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1 Introduction 



Certain spectral statistics of broad classes of iV x random matrix ensembles arc believed to 
follow a universal behavior in the limit iV — > oo. Wigner has observed [30] that the density 
of eigenvalues of large symmetric or hermitian matrices H with independent entries (up to the 
symmetry requirement) converges, as N ^ oo, to a universal density, the Wigner semicircle law. 
Dyson has observed that the local correlation statistics of neighboring eigenvalues inside the bulk 
of the spectrum follows another universal pattern, the Dyson sine- kernel in the TV — >■ oo limit [10]. 
Moreover, any fc-point correlation function can be obtained as a determinant of the two point 
correlation functions. The precise form of the universal two point function in the bulk seems to 
depend only on the symmetry class of the matrix ensemble (a different universal behavior emerges 
near the spectral edge [28]). 

Dyson has proved this fact for the Gaussian Unitary Ensemble (GUE), where the matrix ele- 
ments arc independent, identically distributed complex Gaussian random variables (subject to the 
hermitian constraint). A characteristic feature of GUE is that the distribution is invariant under 
unitary conjugation, H — > U* HU for any unitary matrix U . Dyson found an explicit formula for 
the joint density function of the N eigenvalues. The formula contains a characteristic Vandermonde 
determinant and therefore it coincides with the Gibbs measure of a particle system interacting via 
a logarithmic potential analogously to the two dimensional Coulomb gas. Dyson also observed that 
the computation of two point hmction can be reduced to asymptotics of Hermite polynomials. 

His approach has later been substantially generalized to include a large class of random matrix 
ensembles, but always with unitary (orthogonal, symplectic, etc.) invariance. For example, a 
general class of invariant ensembles can be given by the measure cxp(— Tr y(_ff ))di7 on the 
space of hermitian matrices, where AH stands for the Lcbesgue measure for all independent matrix 
entries, Z is the normalization and y is a real function with certain smoothness and growth 
properties. For example, the GUE ensemble corresponds to V{x) = . 

The joint density function is explicit in all these cases and the evaluation of the two point 
function can be reduced to certain asymptotic properties of orthogonal polynomials with respect 
to the weight function exp(— y(a;)) on the real line. The sine kernel can thus be proved for a wide 
range of potentials V . Since the references in this direction are enormous, we can only refer the 
reader to the book by Deift [9] for the Riemann-Hilbert approach, the paper by Levin and Lubinsky 
[23] and references therein for approaches based on classical analysis of orthogonal polynomials, 
or the paper by Pastur and Shcherbina [26] for a probabilistic/statistical physics approach. The 
book by Anderson ct al [1] or the book by Metha [25] also contain extensive lists of literatures. 

Since the computation of the explicit formula of the joint density relies on the unitary invariance, 
there have been very little progress in understanding non-unitary invariant ensembles. The most 
prominent example is the Wigner ensemble or Wigner matrices, i.e., hermitian random matrices 
with i.i.d. entries. Wigner matrices are not unitarily invariant unless the single entry distribution 
is Gaussian, i.e. for the GUE case. The disparity between our understanding of the Wigner 
ensembles and the unitary invariant ensembles is startling. Up until the very recent work of [14], 
there was no proof that the density follows the semicircle law in small spectral windows unless 
the number of eigenvalues in the window is at least ^/N . This is entirely due to a serious lack of 
analytic tools for studying eigenvalues once the mapping between eigenvalues and Coulomb gas 
ceases to apply. At present, there are only two rigorous approaches to eigenvalue distributions: the 
moment method and Green function method. The moment method is restricted to studying the 
spectrum near the edges [28] ; the precision of the Green function method seems to be still very far 
from getting information on level spacing [6] . 
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Beyond the unitary ensembles, Johansson [21] proved the sine-kernel for a broader category of 
ensembles, i.e., for matrices of the form H + sV where if is a Wigner matrix, V is an independent 
GUE matrix and s is a positive constant of order one. (Strictly speaking, in the original work 
[21], the range of the parameter s depends on the energy E. This restriction was later removed by 
Ben Arous and Peche [3], who also extended this approach to Wishart ensembles). Alternatively 
formulated, if the matrix elements are normalized to have variance one, then the distribution of 
the matrix elements of the ensemble H + sV is given by ly * Gs, where i/ is the distribution of the 
Wigner matrix elements and Gs is the centered Gaussian law with variance s^. Johasson's work 
is based on the analysis of the explicit formula for the joint eigenvalue distribution of the matrix 
H + sV (see also [7]). 

Dyson has introduced a dynamical version of generating random matrices. He considered a 
matrix-valued process H -\- sV where is a matrix-valued Brownian motion. The distribution 
of the eigenvalues then evolves according to a process called Dyson's Brownian motions. For the 
convenience of analysis, we replace the Brownian motions by an Ornstein-Uhlenbeck process so that 
the distribution of GUE is the invariant measure of this modified process, which we still call Dyson's 
Brownian motion. Dyson's Brownian motion thus can be viewed as a reversible interacting particle 
system with a long range (logarithmic) interaction. This process is well adapted for studying the 
evolution of the empirical measures of the eigenvalues, see [18]. The sine kernel, on the other hand, 
is a very detailed property which typically cannot be obtained from considerations of interacting 
particle systems. The Hamiltonian for GUE, however, is strictly convex and thus the Dyson's 
Brownian motion satisfies the logarithmic Sobolev inequality (LSI) . It was noted in the derivation 
of the Navier-Stokes equations [12, 27] that the combination of the Guo-Papanicolaou-Varadhan 
[20] approach and LSI provides very detailed estimates on the dynamics. 

The key observation of the present paper is that this method can also be used to estimate 
the approach to local equilibria so precisely that, after combining it with existing techniques from 
orthogonal polynomials, the Dyson sine kernel emerges. In pursuing this approach, we face two 
major obstacles: 1. Good estimate of the initial entropy, 2. Good understanding of the structure of 
local equilibria. It turns out that the initial entropy can be estimated using the explicitly formula 
for the transition kernel of the Dyson's Brownian motion (see [7] and [21]) provided strong inputs 
on the local semicircle law [14] and level repulsion [15] are available. 

The structure of local equilibria, however, is much harder to analyze. Typically, the local equi- 
librium measures are finite volume Gibbs measures with short range interaction and the boundary 
effects can be easily dealt with in the high temperature phase. In the GUE case, the logarithmic 
potential does not even decay at large distance and the equilibrium measure can depend critically 
on the boundary conditions. The theory of orthogonal polynomials provides explicit formulae for 
the correlation functions of this highly correlated Gibbs measure. These formulae can be effectively 
analyzed if the external potential (or logarithm of the weight function in the terminology of the 
orthogonal polynomials) is very well understood. Fortunately, we have proved the local semicircle 
law up to scales of order 1/A^ and the level repulsion, which can be used to control the bound- 
ary effects. By invoking the theorem of Levin and Lubinsky [23] and the method of Pastur and 
Shcherbina [26] we arc led to the sine kernel. 

It is easy to see that adding a Gaussian component of size much smaller than to the 

original Wigner matrix would not move the eigenvalues sufficiently to change the local statistics. 
Our requirement that the Gaussian component is at least of size N~'^/^ comes from technical 
estimates to control the initial global entropy and it does not have any intrinsic meaning. The 
case that the variance is of order N^^, however, is an intrinsic barrier which is difficult to cross. 
Nevertheless, we believe that our method may offer a possible strategy to prove the universality of 
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sine kernel for general Wigner matrices. 

After this manuscript had been completed, we found a different approach to prove the Dyson 
sine kernel [16], partly based on a contour integral representation for the two-point correlation 
function [7, 21]. Shortly after our manuscripts were completed, we learned that our main result 
was also obtained by Tao and Vu in [29] with a different method under no regularity conditions on 
the initial distribution v provided the third moment of u vanishes. 

Although the results in this paper are weaker than those in [16] and [29], we believe that the 
method presented here has certain independent interest. Unlike [16] and [29], this approach does 
not use the contour integral representation of the two point correlation function. Hence, it may 
potentially have a broader applicability to other matrix ensembles for which such representation 
is not available. 

Acknowledgements. We would like to thank the referees for suggesting several improvements of 
the presentation. 

2 Main theorem and conditions 

Fix e N and we consider a Hermitian matrix ensemble of N x N matrices H — {hik) with the 
normalization 

hik = N^^^^Zik, zik = xik + iyik, (2.1) 

where X(k, yik for i < k are independent, identically distributed random variables with distribution 
ly = that has zero expectation and variance ^. The diagonal elements are real, i.e. yu = 
and and xu are also i.i.d., independent from the off-diagonal ones with distribution v = i/^^^ that 
has zero expectation and variance one. The superscript indicating the A^-dependence of v, v will 
be omitted. 

We assume that the probability measures v and v have a small Gaussian component of variance 
where /3 > is some fixed positive number. More precisely, we assume there exist 
probability measures vq and vq with zero expectation and variance i and 1, respectively, such that 

V ^Vs^G^/^, v^Vs^Gs, (2.2) 

where Gs{x) = (27rs)~^ exp(— a;^/2s) is the Gaussian law with variance and Vg, Vs are the 
rescaling of the laws I'o, ?o to ensure that v and v have variance 1/2 and 1; i.e, explicitly 

v,{Ax) = (1 - s'')-^'^yo{dx{l - s2)-i/2)^ ^^^(dx) = (1 - s2)-i/2i7o(dx(l - s^)~^/^). 

This requirement is equivalent to considering random matrices of the form 

H ={l- s^f/'^H + sV, (2.3) 

where H is a, Wigner matrix with single entry distribution vq and vq, and is a GUE matrix 
whose elements are centered Gaussian random variables with variance 1/A^. 

Furthermore, we assume that v is absolutely continuous with positive density functions h{x) > 
0, i.e. we can write it as dv[x) = h{x)dx = exjp{—g{x))dx with some real function g. We assume 
the following conditions: 
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• The measure di^ satisfies the logarithmic Sobolev inequahty, i.e. there exists a constant S 
such that 

[ u\ogudi^<S [ iVVwpdz/ (2.4) 

holds for any density function m > with J udv = 1. 

• The Fourier transform of the functions h and h{Ag) satisfy the decay estimates 

\h{t, s)\ < -9 , \hAg{t, s)\ < — — -9 (2.5) 

with some constants uj,u} > 0. 

• There exists a iJq > such that for the distribution of the diagonal elements 

Do := J cxp [(5oa;^] di/(a;) < oo . (2.6) 

Although the conditions arc stated directly for the measures v and v, it is easy to see that it is 
sufficient to assume that i/q satisfies (2.4) and (2.5) and Vq satisfies (2.6). Wc remark that (2.4) 
implies that (2.6) holds for i/ instead oily as well (see [22]). 

The eigenvalues of H are denoted by Ai , A2 , . . . Ajv • The law of the matrix ensemble in- 
duces a probability measure on the set of eigenvalues whose density function will be denoted 
by p{Xi, A2, . . . , Aat). The eigenvalues are considered unordered for the moment and thus p is a 
symmetric function. For any fc = 1, 2, . . . , A'^, let 

p^'')(Ai,A2,...Afc) / p(Ai,A2,...,AAr)dAfc+i ...dAAT 

be the fc-point correlation function of the eigenvalues. The fc = 1 point correlation function 
(density) is denoted by g{X) := ]3'^'(A). With our normalization convention, the density g{X) is 
supported in [—2, 2] and in the N 00 limit it converges to the Wigner semicircle law given by 
the density 

Qscix) = ^v/4-a;2l[_2,2](-T). (2.7) 
The main result of this paper is the following theorem: 

Theorem 2.1 Fix arbitrary positive constants /? > and k > 0. Consider the Wigner matrix 
ensemble with a Gaussian convolution of variance ~ ]\[-3/4+i3 giy^^ i)y (2.3) and assume (2.4)- 
(2.6). Let p^^^ be the two point correlation function of the eigenvalues of this ensemble. Let 
\Eq\ <2 — k and 

0{a,b)^g(a-b)h{^) (2.8) 
with g, h smooth and compactly supported functions such that h > and J h = 1. Then we have 



lim lim ^ [ dE [ f dadbOia.b)^^ p^^Ue + ^^,E 

s^0N^<^2S Je,-s J J PsciE) V Psc{E)N 



b 



sin7r7i\2] 

du. 
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The factor g in the observable (2.8) tests the eigenvalue differences. The factor ft,, that dis- 
appears in the right hand side of (2.9), is only a normalization factor. Thus the special form of 
observable (2.8) directly exhibits the fact that the local statistics is translation invariant. 

Conventions. All integrations with unspecified domains are on R. We will use the letters C 
and c to denote general constants whose precise values are irrelevant and they may change from 
line to line. These constants may depend on the constants in (2.4)-(2.6). 



2.1 Outline of the proof 

Our approach has three main ingredients. In the first step, we use the entropy method from 
hydrodynamical limits to establish a local equilibrium of the eigenvalues in a window of size 
iV~^+'^ (with some small e > 0), i.e. window that typically contains n = eigenvalues. This 
local equilibrium is subject to an external potential generated by all other eigenvalues. In the 
second step we then prove that the density of this equilibrium measure is locally constant by using 
methods from orthogonal polynomials. Finally, in the third step, we employ a recent result [23] to 
deduce the sine-kernel. We now describe each step in more details. 

Step 1. 

We generate the Wigner matrix with a small Gaussian component by running a matrix-valued 
Ornstein-Uhlenbeck process (3.1) for a short time of order t ^ N~^, C > 0. This generates a 
stochastic process for the eigenvalues which can be described as Ornstein-Uhlenbeck processes for 
the individual eigenvalues with a strong interaction (3.10). 

This process is the celebrated Dyson's Brownian motion (DBM) [11] and the equilibrium mea- 
sure is the GUE distribution of eigenvalues. The transition kernel can be computed explicitly (5.12) 
and it contains the determinantal structure of the joint probability density of the GUE eigenvalues 
that is responsible for the sine- kernel. This kernel was analyzed by Johansson [21] assuming that 
the time t is of order one, which is the same order as the relaxation time to equilibrium for the 
Dyson's Brownian motions. The sine-kernel, however, is a local statistics, and local equilibrium 
can be reached within a much shorter time scale. To implement this idea, we first control the 
global entropy on time scale N"^ by A^^+", with a > 1/4 (Section 5.2). 

More precisely, recall that the entropy of //i with respect to a probability measure /i is given 

by 

s{f) = s^if) s{f^l\^l) - / /(iog/)d/i. 



In our application, the measure /x is the Gibbs measure for the equilibrium distribution of the 
(ordered) eigenvalues of the GUE, given by the Hamiltonian 



H{X) = N 



N .2 



2 N 

i—1 i<j 



(2.10) 



If ft denotes the joint probability density of the eigenvalues at the time t with respect to fi, then 
the evolution of ft is given by the equation 

dtft = Lft, (2.11) 
where the generator L is defined via the Dirichlet form 

N 
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The evolution of the entropy is given by the equation 




the transition kernel of (2.11) and several inputs from our previous papers [13, 14, 15] on the local 
semicircle law and on the level repulsion for general Wigner matrices. We need to strengthen some 
of these inputs; the new result will be presented in Section 4 with proofs deferred to Appendix A, 
Appendix B and Appendix C. 

It is natural to think of each eigenvalue as a particle and we will use the language of interacting 
particle systems. We remark that the entropy per particle is typically of order one in the interacting 
particle systems. But in our setting, due to the factor N in front of the Hamiltonian (2.10), the 
typical size of entropy per particle is of order N. Thus for a system bearing little relation to the 
equilibrium measure fi, we expect the total entropy to be 0{N'^). So the bound (2.12) already 
contains nontrivial information. However, we believe that one should be able to improve this bound 
to a ~ and the additional a > 1/4 power in (2.12) is only for technical reasons. This is the main 
reason why our final result holds only for a Gaussian convolution with variance larger than N~^^^. 
The additional factor originates from Lemma 5.3 where we approximate the Vandermonde 
determinant appearing in the transition kernel by estimating the fluctuations around the local 
semicircle law. We will explain the origin of a > 1/4 in the beginning of Appendix D where the 
proof of Lemma 5.3 is given. 

From the initial entropy estimate, it follows that the time integration of the Dirichlet form 
is bounded by the initial entropy. For the DBM, due to convexity of the Hamiltonian of the 
equilibrium measure ^, the Dirichlet form is actually decreasing. Thus for t ~ tN^^ with some 
T > 2 we have 



The last estimate says that the Dirichlet form per particle is bounded by A^^+"r^^. So if we take 
an interval of n particles (with coordinates given by x = (xi, . . . , a;„)), then on average the total 
Dirichlet form of these particles is bounded by nA^^+"T~^. We will choose n = N'^ with some very 
small e > 0. As always in the hydrodynamical limit approach, we consider the probability law of 
these n particles given that all other particles (denoted by y) are fixed. Denote by /^y(dx) the 
equilibrium measure of x given that the coordinates of the other N — n particles y are fixed. Let 
/y.t be the conditional density of ft w.r.t. /iy(dx) with y given. The Hamiltonian of the measure 
/iy(dx) is given by 




i=l 



l<i<j<n 



k i=l 



and it satisfies the convexity estimate 



HcssHy(x) >Y,\^^yk\~^- 



k 

If y are regularly distributed, we have the convexity bound 

Hess'Hy(x) > — —. 
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This implies the logarithmic Sobolev inequality 



where in the last estimate some additional n-factors were needed to convert the local Dirichlet 
form estimate per particle on average to an estimate that holds for a typical particle. Thus we 
obtain 



provided we choose t = N^^t = N^~^ with (3 > lOe + a (Section 6). The last inequality asserts 
that the two measures fyfJ-y and fiy are almost the same and thus we only need to establish the 
sine kernel for the measure fiy. At this point, we remark that this argument is valid only if y 
is regularly distributed in a certain sense which we will call good configurations (Definition 4.1). 
Precise estimates on the local semicircle law can be used to show that most external configurations 
are good. Although the rigorous treatment of the good configurations and estimates on the bad 
configurations occupy a large part of this paper, it is of technical nature and we deferred the proofs 
of several steps to the appendices. 



In Sections 8, 9 and 10, wc refine the precision on the local density and prove that the density 
is essentially constant pointwisc. Direct probabilistic arguments to establish the local semicircle 
law in [15] rely on the law of large numbers and they give information on the density on scales 
of much larger than A^"^, i.e. on scales that contain many eigenvalues. The local equilibrium is 
reached in a window of size n/N and within this window, we can conclude that the local semicircle 
law holds on scales of size n'^ /N with an arbitrary small 7 > 0. However, this still does not control 
the density pointwise. To get this information, we need to use orthogonal polynomials. 

The density in local equilibrium can be expressed in terms of sum of squares of orthogonal 
polynomials pi{x),p2{x), . . . with respect to the weight function exp {—nUy{x)) generated by the 
external configuration y (see Section 8 for precise definitions). To get a pointwise bound from 
the appropriate bound on average, we need only to control the derivative of the density, that, 
in particular, can be expressed in terms of derivatives of the orthogonal polynomials pk- Using 
integration by parts and orthogonality properties of pk , it is possible to control the norm of 
pj, in terms of the norm of pk{x)Uy{x). Although the derivative of the potential is singular, 
llPfcC^ylU can be estimated by a Schwarz inequality at the expense of treating higher norms of 
Pk (Lemma 8.1). In this content, we will exploit the fact that we are dealing with polynomials 
by using the Nikolskii inequality which estimates higher norms in terms of lower ones at the 
expense of a constant depending on the degree. To avoid a very large constant in the Nikolskii 
inequality, in Section 7 we first cutoff the external potential and thus we reduce the degree of the 
weight function. 

We remark that our approach of using orthogonal polynomials to control the density pointwise 
was motivated by the work of Pastur and Shcherbina [26] , where they proved sine- kernel for unitary 
invariant matrix ensembles with a three times differentiable potential function on the real line. In 
our case, however, the potential is determined by the external points and it is logarithmically 
divergent near the edges of the window. 



Finally, in Section 11, we complete the proof of the sine-kernel by applying the main theorem 
of [23]. This result establishes the sine-kernel for orthogonal polynomials with respect to an n- 
dependent sequence of weight functions under general conditions. The most serious condition to 




< < 1, 



Step 2. 



Step 3. 
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verify is that the density is essentially constant pointwise ~ the main result we have achieved in 
the Step 2 above. We also need to identify the support of the equilibrium measure which will be 
done in Appendix F. 

We remark that, alternatively, it is possible to complete the third step along the lines of the 
argument of [26] without using [23] . Using explicit formulae from orthogonal polynomials and the 
pointwise control on the density and on its derivative, it is possible to prove that the local two- 

(2) 

point correlation function pn {x,y) is translation invariant as n — )■ oo. After having established 
the translation invariance of p*^^' , it is easy to derive an equation for its Fourier transform and 
obtain the sine-kernel as the unique solution of this equation. We will not pursue this alternative 
direction in this paper. 



3 Dyson's Brownian motion 

3.1 Ornstein-Uhlenbeck process 

We can generate our matrix H (2.3) from a stochastic process with initial condition H. Consider 
the following matrix valued stochastic differential equation 

dHt = -^d/3, - ^Htdt (3.1) 



where /3( is a hcrmitian matrix-valued stochastic process whose diagonal matrix elements are 
standard real Brownian motions and whose off-diagonal matrix elements are standard complex 
Brownian motions. 

For completeness we describe this matrix valued Ornstein-Uhlenbeck process more precisely. 
The rescaled matrix elements Zij = N^^^hij evolve according to the complex Ornstein-Uhlenbeck 
process 



dzy = d/3.y - -z,jdt, ^, .7 = 1, 2, . . . iV. (3.2) 



1 

For i ^ j , /3 — Pij is a complex Brownian motion with variance one. The real and imaginary parts 
of z ~ X + iy satisfy 

da; = ~ 2^'^^' '^^ ^ ^ ^^^^ 

with /? = "75^'^^ ^ ^^y^ ^^'^ where Px,Py are independent standard real Brownian motions. For 
the diagonal elements i — j in (3.2), Pa is a standard real Brownian motion with variance 1. 

To ensure Zij = Zji , for i < j we choose /3y to be independent complex Brownian motion with 
E [^ = 1, we set /3ji := Pij and we let fin to be a real Brownian motion with E = 1. Then 



{Azik){dzi>j) = [Af3.ik)[A(ijt) = Si^duAt. (3.3) 
We note that dTri?^ = Q, thus 

Tr H^^N (3.4) 

remains constant for all time. 

If the initial condition of (3.1) is distributed according to the law of H, then the solution of 
(3.1) is clearly 
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where ^ is a standard GUE matrix (with matrix elements having variance 1 /N) that is independent 



of H. With the choice of t satisfying (1 



N 3/4+^^ .^^g gg(j ilisx H given in (2.3) has the same law as Ht. 



*) = .s2 = N-^/^+P, i.e. t = -log(l - Ar-3/4+,9) 



3.2 Joint probability distribution of the eigenvalues 

We will now analyze the eigenvalue distribution of Ht- Let \{t) = (Ai(i), A2(t), . . . , Ajv(i)) e 
denote the eigenvalues oi Ht- As t ^ oo, the Ornstein-Uhlenbeck process (3.1) converges to the 
standard GUE. The joint distribution of the GUE eigenvalues is given by the following measure Jl 
on 



pN 



fi = /i(dA) 



-dA, 



n{x) = N 



N 



(3.5) 



The measure fj, has a density with respect to Lebesgue measure given by 



■ exp 



2 ^ ^ 



Aw(A)^ /I(dA) = w(A)dA, 



(3.6) 



where A7v(A) = nj<j('^i ^ '^j)- This is the joint probability distribution of the eigenvalues of the 
standard GUE ensemble normalized in such a way that the matrix elements have variance 1/A'' 
(see, e.g. [25]). With this normalization convention, the bulk of the one point function (density) 
is supported in [—2, 2] and in the N oo limit it converges to the Wigner semicircle law (2.7). 

For any finite time t < oo we will represent the joint probability density of the eigenvalues 
of Ht as ft{\)u{\) , with limt_>.oo /i(A) = 1. In particular, we write the joint distribution of the 
eigenvalues of the initial Wigner matrix H as /o(A)/i(dA) = /o(A)u(A)dA. 

3.3 The generator of Dyson's Brownian motion 

The Ornstein-Uhlenbeck process (3.1) induces a stochastic process for the eigenvalues. 
Let L be the generator given by 



^1 ^ / 1 1 1 

i—1 i—1 \ j^i 



(3.7) 



acting on L'^{fi) and let 



r ^ 1 /■ 

D{f) = - / fLfdJl = E ^ / (9,ffdjl 



(3.8) 



be the corresponding Dirichlet form, where dj = d\ . . Clearly Jl is an invariant measure for the 
dynamics generated by L. 

Let the distribution of the eigenvalues of the Wigner ensemble be given by /o(A)pt(dA). We 
will evolve this distribution by the dynamics given by L: 



dtft = Lft 



(3.9) 
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The corresponding stochastic differential equation for the eigenvalues X{t) is now given by (see, 
e.g. Section 12.1 of [19]) 



dA,: 



1 1 1 

2^'+ TV 2^ A, -A, 



l<i< N, 



(3.10) 



where {Bi : \ < i < iV} is a collection of independent Brownian motions and with initial condition 
A(0) that is distributed according to the probability density /o(A)/I(dA). 

We remark that u{\) and /t(A) are symmetric functions of the variables Aj and u vanishes 
whenever two points coincide. By the level repulsion we also know that /o(A)m(A) vanishes 
whenever Aj = Afc for some j ^ k. We can label the eigenvalues according to their ordering, 
Ai < A2 < . . . < Aat, i.e. one can consider the configuration space 



|a = (Ai, A2, . . . , Aat) : Ai < A2 < . . . < AArj 



c 



(3.11) 



instead of the whole R^. With an initial point in 5^^^ the equation (3.10) has a unique solution 
and the trajectories do not cross each other, i.e. the ordering of eigenvalues is preserved under the 
time evolution and thus the dynamics generated by L can be restricted to S^^^; see, e.g. Section 
12.1 of [19]. The main reason is that near a coalescence point Ai = Xj, i > j, the generator is 



1 



-.91 



1 



A. -A 



1 

2N 



with a = ^(Ai + Xj), b = 5 (A; — Xj). The constant 1 in front of the drift term is critical for the 



Bessel process 



j^db not to reach the boundary point 6 = 0. 
Note that the symmetric density function u{X) defined on can be restricted to S^^^' as 



uiX) = N\ u(A)l(AeSW). 



(3.12) 



The density function of the ordered eigenvalues is thus ft{X)u{X) on S^^^. Throughout this 
paper, with the exception of Section 5.2, we work on the space 'E,^^\ i.e., the equilibrium measure 
/i(dA) ~ u(A)dA with density u{X) and the density function /t(A) will be considered restricted to 



4 Good global configurations 

Several estimates in this paper will rely on the fact that the number of eigenvalues A// in intervals 
/ with length much larger than 1/iV is given by the semicircle law [15]. In this section we define 
the set of good global configurations, i.e. the event that the semicircle law holds on all subintervals 
in addition to a few other typical properties. 
Let 

1 N 

^(dx)^-Y.^('^^^^^ (4.1) 
be the empirical density of the eigenvalues. For an interval / ~ [a, b] we introduce the notation 

A/} = A^[a; b] = N [ uj{dx) 
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for the number of eigenvalues in /. For the interval [E — 77/2, E + ri/2] of length rj and centered at 
E we will also use the notation 

Af,j{E) J\f[E - 7]/2; E + 7]/2] . 

Let 



1 V 

TT + r/ 

be the empirical density smoothed out on scale rj. Furthermore, let 



u^ix) := {9„*iu){x), with 9„{x) = ^ ^2 [ 2 (4-2) 



3= 

be the Stieltjcs transform of the empirical eigenvalue distribution and 



be the Stieljes transform of the semicircle law. The square root here is defined as the analytic 
extension (away from the branch cut [—2, 2]) of the positive square root on large positive numbers. 
Clearly ujy{x) = 7r~^Im m{x + iy) for y > 0. 

We will need an improved version of Theorem 4.1 from [15] that is also applicable near the 
spectral edges. The proof of the following theorem is given in Appendix A. 

Theorem 4.1 Assume that the Wigner matrix ensemble satisfies conditions (2.4)-(2.6) and as- 
sume that y is such that {log N)'^/N <\y\<l. 

(i) For any q > 1 we have 

E\mix + iy)\'^ < Cq (4.4) 
E [u:y{x)Y < Cg (4.5) 

where Cq is independent of x and y. 

(ii) Assume that \x\ < K for some K > Q. Then there exists c > such that 



V {\m{x + iy) - msc{x + iy)\ > 5) < C e-'V^I'^l 12-1:^^11 (4 5) 

for all 6 > small enough and all N large enough (independently of S). Consequently, we have 

E \mix + ty) -Emix + zy)^ < (jv|^||2^' |^||)g/2 + C,1(A^|2/||2 - \x\\ < (logiV)^) (4.7) 
with some q-dependent constant Cq. Moreover, 

C 

\Em{x + iy)~m.,,{x + iy)\ < ^|^|3/2|2 „ |a;||i/2 (^.8) 

for all N large enough (independently of x,y). 

(Hi) Assuming \x\ < K and that N\y\\2 — |a;|| > (logiV)^ we also have 

C 

\Em{x + iy) - m,,{x + iy)\ < —— — . (4.9) 

A/ |y||2 — |a;||'^/^ 
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As a corollary to Theorem 4.1, the semicircle law for the density of states holds locally on very 
short scales. The next proposition can be proved, starting from Theorem 4.1, exactly as Eq. (4.3) 
was shown in [13]. 

Proposition 4.1 Assuming (2.4)-(2.6), for any sufficiently small 5 and for any rf with 



C5'^'^{\ogNf/N <T]* <C- 
(with a sufficiently large constant C) we have 



sup 

£;e[-2+K.,2-K.] 



(4.10) 



We also need an estimate directly on the number of eigenvalues in a certain interval, but this will 
be needed only away from the spectral edge. The following two results estimate the deviation of 
the normalized empirical counting function jfAf[—oo,E] = jjff{\j < E} and its expectation 



m.{E) := -^EN[-oo,E] 
from the distribution function of the semicircle law, defined as 



^yisciE) 



gsc{x)dx. 



(4.11) 



(4.12) 



Proposition 4.2 Assume that the Wigner matrix ensemble satisfies conditions (2.4) -(2.6). Let 
K > be fixed. For any < S < I and \E\ < 2 — k, we have 



J^[-oo,E] 



N 



'yisciE)\ >d} 



>6 \ <C e' 



cS-/N 



with K-dependent constants. Moreover, there exists a constant C > such that 

C 



/oo 
\m{E) - ^sc{E)\dE < 
-oo 



7V6/7 



(4.13) 



(4.14) 



The proof of this proposition will be given in Appendix B. 

Next we define the good global configurations; the idea is that good global configurations are 
configurations for which the semicircle law holds up to scales of the order (log N)^/N (and so that 
some more technical conditions are also satisfied). By Proposition 4.1 and Proposition 4.2, we will 
see that set of these configurations have, asymptotically, a full measure. As a consequence, we will 
be able to neglect all configurations that are not good. 



Let 



2[Nyz 



1, 



(4.15) 



with some small constants < £, 7 < and m = 0,1,2,..., log N . Here [x] denotes the integer 

part of a; e R. Note that within this rang is satisfied if 

e,7 are sufficiently small. Let 



< sup 

_Ee[-2+K/2,2-K/2] 



- QsciE) 



< 



^7/12 I 



(4.16) 
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then we have 

P(f7(™') > l-Ce-'^""^" (4.17) 



with respect to any Wigner ensemble. This gives rise to the following definition. 

Definition 4.1 Let ij*^ ~ 2™-rCN^^ with some small constant 7 > 0, 771 = 0, 1, 2, . . .logiV, and 
let K be a fixed big constant. The event 

0:= fl iv^ ^ sn^Nr,-.{E) < KNrj*}n{M{~K,K) = n} (4.18) 

will be called the set 0/ good global configurations. 

Lemma 4.2 The probability of good global configurations satisfies 

F{n) > 1 - Ce"™^'" (4.19) 

with respect to any Wigner ensemble satisfying the conditions (2.4) and (2.5) 

Proof. The probability of fi^"') was estimated in (4.17). The probability of the seeond event in 
(4.18) can be estimated by (4.13) from Proposition 4.2 and from ^Rsc(O) = 1/2. The third event 
is treated by the large deviation estimate on Afj for any interval / with length |/| > (log A^)^/iV 
(see Theorem 4.6 from [15]; note that there is a small error in the statement of this theorem, since 
the conditions y > (iog N)/N and |/| > (logA^)/A^ should actually be replaced by the stronger 
assumptions y > (log N)'^/N and |/| > (logiV)^/A^ which are used in its proof): 



P{AAr > i^iV|/|} < e^^V-^^l^l. (4.20) 
The fourth event is a large deviation of the largest eigenvalue, see, e.g. Lemma 7.4. in [13]. □ 

In case of good configurations, the location of the eigenvalues are close to their equilibrium 
localition given by the semicircle law. The following lemma contains the precise statement and it 
will be proven in Appendix C. 

Lemma 4.3 Let Ai < A2 < . . . < Xn denote the eigenvalues in increasing order and let k > 0. 
Then on the set Q and if N > No{k), it holds that 

\Xa - m-,\aN-^)\ < Ck-i/^ti--^/^ (4.21) 

for any Nk^/'^ < a < N{1 — k'^/^) (recall the definition of from (4.12)), and 

NgsciXaKh - A„) - (6 - a)| < Ck"'/^ [n'>\b - a|3/4 + N-^\b - a\^] (4.22) 

for any Nk^/^ < a <b < N{1- k^/S) and |5 - a| < CNn^'^/^. 
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4.1 Bound on the level repulsion and potential for good configurations 
Lemma 4.4 On the set f2 and with the choice n given in (4.15), we have 

and 

(l-K^/=)7V 

> E E^^SC,.- ,4.24) 

with respect to any Wigner ensemble satisfying the conditions (2.4) and (2.5) 
Proof. First wc partition the interval [—2 + k, 2 — k] into subintervals 

= [n''iV"i(r- i),n''iV"i(r + i)], r G Z, |r| < ri (2 - K)A^n^'', (4.25) 
that have aheady been used in the proof of Lemma 4.3. On the set f2 we have the bound 

N{Ir) < KN\Ir\ < Cn^ (4.26) 

on the number of eigenvahies in each interval Ir- Moreover, the constraint Nk^I'^ <l< N(\-K'l'^) 
implies, by (4.21), that < 2 — k for sufBciently small k, thus \i £ Ir with |r| < ri. 
We estimate (4.23) as follows: 



A 



l(A^ G /r)l(2'= < N\Xj - X(\ < 2'=+!) 



(4.27) 



^^^1" E EE2"''l{^^^^- 2'=<iV|A, -A,|<2'=+i} 



\r\<ri j<e kel 

where the star in the first summation indicates a restriction to Nn^/"^ 

< .7 < ^ < (1 - K^/^)iV. By 

(4.26), for any fixed r, the summation over i with S Ir contains at most CrC elements. The 
summation over j contains at most Cri* elements if fc < 0, since Xi G Ir and |Aj — A^ | < i^^'^N^^ < 
imply that Xj G Ir U Ir+i- If fc > 0, then the j-summation has at most C(2'° + n"') elements 
since in this case Xj G • \s — r\ < C ■ 2^n~^ + 1}. Thus we can continue the above estimate 

as 



^-^^E E 2-2fcp|3/c/,._iU/,U/,+i : |/| < 2^+i7V-i, AAj > 2} 



^ ^ 2-2fc(„7 4.2^). 



TV 

fc>0 |r|<ri 
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The second sum is bounded by Cn'^"'. In the first sum, we use the level repulsion estimate by 
decomposing /^-i U U Ir+i = Um ^™ intervals of length 2'^'+^iV^^ that overlap at least by 
2''^^N~^, more precisely 



n'^N-\r - 1 - i) + 2'=+^iV-i(m - 1), n''Af-^(r - 1 - 1) + 2''+^N-'^{m + 1) 



wherem = l,2,...,3n^-2-'^-^ Then 



3n"'-2~ 



rn—1 

Using the level repulsion estimate given in Theorem 3.4 of [15] (here the condition (2.5) is used) 
and the fact that Jm C Ir-i U /r U Ir+i C [—2 + k,2 — k] since \r\ < ri, we have 

F{AAj„ > 2} < C(7V| J™|)4 

and thus 

A < ^ J2 E 2-2'=2-'^-i(2'^+2)4 < Cn^f. 

fe= — oo |7-|<ri 

and this completes the proof of (4.23). 

For the proof of (4.24), we note that it is sufficient to bound the event when N\Xj — > 1 
after using (4.23). Inserting the partition (4.25), we get 

l{N\Xe^X,\ > 1) _ 1 ^ ^ 1(A, € /,, A, € L)liN\Xe ^X,\> 1) 



1^., ^ l{N\Xe^X,\>l) 1 V Fl V 



N 7V(A,-A,) N^^^^^^^ 7V(A,-A,) 

<^ V El. , 

- iV ^ Ti-ffls - r| - 11+ + 1 
< CnTlogiV. 

Recalling the choice of n completes the proof of Lemma 4.4. □ 

5 Global entropy 

5.1 Evolution of the entropy 

Recall the definition of the entropy of ffi with respect to 

s,if) ■.^sif^l\^l) = I /(iog/)dA. 

and let ft solve (3.9). Then the evolution of the entropy is given by the equation 
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and thus using that S{ft) > we have 



' D{^^)du<S{fs). (5.1) 

For dynamics with energy H and the convexity condition 

Hess(H) = V^n > A (5.2) 
for some constant A, the following Bakry-Emcry inequality [2] holds: 

(notice the additional N factor due to the in front of the second order term in the generator 
i, see (3.7)). This implies the logarithmic Sobolev inequality that for any probability density 5, 
with respect to /x, 

D{^9) = - I VaLVadfi > ^Sig) (5.3) 
In this case, the Dirichlet form is a decreasing function in time and we thus have for any t > s that 



Di^f,) < (5.4) 



In our setting, we have 



as a matrix inequality away from the singularities (see remark below how to treat the singular set). 
Thus we have 

dtDi^,) < -D{^,) (5.6) 

and by (5.3) 

dtSift) < -Sift) (5.7) 

This tells us that S{ft) in (3.9) is exponential decaying as long as < ^ 1. But for any time i ~ 1 
fixed, the entropy is still the same order as the initial one. Note that t ^ 1 is the case considered 
in Johasson's work [21]. 

Remark 5.1 The proof of (5.5) and the application of the Bakry-Emery condition in (5.6) requires 
further justification. Typically, Bakry-Emery condition is applied for Hamiltonians Ji defined on 
spaces without boundary. Although the Hamiltonian H (3.5) is defined on R^, it is however convex 
only away from any coalescence points = Xj for some i ^ j; the Hessian of the logarithmic terms 
has a Dirac delta singularity with the wrong (negative) sign whenever two particles overlap. In 
accordance with the convention that we work on the space S^^^ throughout the paper, we have 
to consider Ti restricted to S'^\ where it is convex, i.e. (5.5) holds, but we have to verify that 
the Bakry-Emery result still applies. We review the proof of Bakry and Emery and prove that the 
contribution of the boundary term is zero. 
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Recall that the invariant measure exp(— H)dA and the dynamics L — [A — (V'H)V] are 
restricted to S = S'^^. With h = ^f] we have 

dth^ = Lh^ = 2hLh+ ^{\/hf, i.e. dth^ Lh+ ^h-\\/hf. 

Computing dtD{\/Yt), we have 

= ^ X ^^LVh - l^Vh{V^n)Vh + ±-{yh)V[h-\VhY] 



dA 



dihdjh\'^ 



assuming that the boundary term 



-«dA 



(5.8) 



d,h df.h e-^ = (5.9) 



in the integration by parts vanishes. 

To see (5.9), consider a segment A,; = Xi+i of the boundary (9S. From the explicit representation 
(5.11), (5.12) in the next section, we will see that /t > is a meromorphic function in each variable 
in the domain S for any t > 0. It can be represented as by (A^+i — A,;)^F(A) with some /3 S Z, 
where F is analytic and < < cxj near A.^ = Ai+i. Since /t > 0, we obtain that the exponent 
f3 is non-negative and even. Therefore //^^ behaves as [Ai+i — A^p/^ with a non-negative integer 
exponent /3/2 near Xi = A^+i. It then follows that ^i^/f l9^^,^/7e~^ vanishes at the boundary due 
to the factor (A^+i — A^)^ in e~^, i.e. the integral (5.9) indeed vanishes. 

5.2 Bound on the entropy 

Lemma 5.1 Let s = N^^ . For any a > j we have 

SM ■■= S{fsPi\fi) < CN^+^ (5.10) 

with C depending on a. 

Proof. In the proof we consider the probability density u(A) and the equilibrium measure ^ 
extended to (see (3.12)), i.e. the eigenvalues are not ordered. Clearly S{fsn\fi) = S{fsJl\Ji) 
and we estimate the relative entropy of the extended measures. 

Given the density /o(A)/I(dA) of the eigenvalues of the Wigner matrix as an initial distribution, 
the eigenvalue density /s(A) for the matrix evolved under the Dyson's Brownian motion is given 

by 

fsiX)uiX)^ f g,{X,i^)foii^)u{i^)du (5.11) 
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with a kernel 



Ajv(A) 



dot exp 



-NjcXj - t^fc)2 
2(1 -c2) 



(5.12) 



where c = c(s) = e~*/2 for brevity. The derivation of (5.12) foUows very similarly to Johansson's 
presentation of the Harish-Chandra/Itzykson-Zuber formula (see Proposition 1.1 of [21]) with the 
difference that in our case the matrix elements move by the Ornstcin-Uhlcnbeck process (3.1) 
instead of the Brownian motion. 

In particular, formula (5.12) implies that fs is an analytic function for any s > since 



Ajv(A) 



det exp 



-NjcX, - Vkf 
2(1 -c2) 



j.k 



with an explicit analytic function hs{X). Since the determinant is analytic in A. we see that 
/s(A) is mcromorphic in each variables and the only possible poles of /s(A) come from the factors 
(Ai — Aj)~^ in A7v(A) near the coalescence points. But /s(A) is a non-negative function, so it 
cannot have a singularity of order —1, thus these singular factors cancel out from a factor (A^ — Aj) 
from the integral. Alternatively, using the Laplace expansion the determinant, one can explicitly 
sec that each 2 by 2 subdeterminant from the i-th and j-th columns carry a factor ±(Ai — Aj). 

Then, by Jensen inequality from (5.11) and from the fact that fo{i/)u(v) is a probability density, 
we have 



gs(A,iv) 



u{X) 



SMs)^ I fs{\ogfsW< ff log 
Expanding this last expression we find, after an exact cancellation of the term {N/2) log(27r) 



logdct I exp 



y logiV - ^^^^ log c - y log(l - c^) + log Ajv(A) - log AnM 



2(1 -c2) 



Ar2 



■ logiV 



3,k 



N 



N 



N 



-Y^Xl - 21og Ajv(A) + ^ log J-! ) g,(A, v) f^{u)u{u)AXAu . 



Since s = A^"\ we have logc -l/2iV and log(l - c^) = -logiV + 0{N-^). Hence 



R"xR" 



CN log iV + log Aat ( A) - log Aat (iv) + log det exp 



2(1 -c2) 



j.k 



N ^ ^1 
logTV + - ^ A? - 2 log AAr(A) + J2 log 5s(A, iv)/o(iv)2(iy)dAdz/. (5.13) 

i=l 2=1 J 

For the determinant term, we use that each entry is at most one, thus 

--N{cX,-iyk)^' 



log det exp 



2(1 -c2) 



< log TV!. 



3,k 



19 



The last term in (5.13) can be estimated using Stirling's formula and Riemann integration 



JV N 



5^ log j! < ^ log K j +C log(27ri) 
j=i j=i \ 

rN+l N 

< 



dx X log X — ^ j + CN log N 

.7 = 1 



(5.14) 

~ 2 4 

thus the iA^^logA^ terms cancel. For the terms we need the following approximation 

Lemma 5.2 With respect to any Wigner ensemble whose single-site distribution satisfies (2.4)- 
(2.6) and for any a > 1/4 we have 

N 
i=l 

where the constant in the error term depends on a and on the constants in (2.4) -(2.6). 

Note that (2.6), (2.5) hold for both the initial Wigner ensemble with density /q and for the evolved 
one with density ft. These conditions ensure that Theorem 3.5 of [15] is applicable. 

Proof of Lemma 5.2. The quadratic term can be computed explicitly using (3.4): 

^E^A2 = ^ETri?2 = \n^^^ ( x^Qsc{x)dx, (5.16) 

The second (determinant) term will be approximated in the following lemma whose proof is post- 
poned to Appendix D. 

Lemma 5.3 With respect to any Wigner ensemble whose single-site distribution satisfies (2.4)- 
(2.6) and for any a > 1/4 we have 

ElogAA,(A) = ^ j j \og\x -y\gsc{x)gsc{y)dxdy + 0{N^+''). (5.17) 
Finally, explicit calculation then shows that 

^ j x^Qsc{x) ^^-^ J J ^og\x ~ y\ gsc{x)gsc{y) dx dy = ^, 
and this proves Lemma 5.2. □ 

Hence, continuing the estimate (5.13), we have the bound 

SM < CN'+"+ ff {logAN{X)-\ogAN{i^)}gs{X,u)fo{u)u{u)di^dX 
N ^ 

< CiVi+" + — E^[Aj2(s)-A2(0)] =CiVi+", (5.18) 
j=i 

where we used Lemma 5.2 both for the initial Wigner measure and for the evolved one and finally 
we used that the ETyH^ is preserved, see (3.4). This completes the proof of (5.10). □ 
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6 Local equilibrium 

6.1 External and internal points 

Choose t = tN^^ with some r > 2. Thus from (5.4) with s = N^^, we have 

D{^,) < 25(/jv-i)t-i < CTV^+V-i (6.1) 

by using (5.10). Recah that the eigenvalues are ordered, Ai < A2 < . . . < Xn- Let L < N — n {n 
was defined in (4.15)) and define 

H-lW '■— {Al+1, • • ■ Al+„} 

and 

ni(A) :={Ai,A2,...AAr}\ni(A) 

its complement. For convenience, we will relabel the elements of 11^ as x = {xi, X2, . . . x„} in 
increasing order. The elements of 11^ will be denoted by 

ni(A) := y = {y-L,y-L+i, ■ ■ ■y-i,yi,y2, ■ ■ - yN-L-n) e 

again in increasing order (S was defined in (3.11)). We set 

Jl ■■= {-L,-L+l,...,-l,l,2,...N ~ L-n} (6.2) 

to be the index set of the y's. We will refer to the y's as external points and to the Xj's as 
internal points. Note that the indices are chosen such that for any j we have < Xj for /c < 
and yk > Xj for k > 0. In particular, for any fixed L, we can split any y G S*^^"") as y = (y-, y+) 
where 

y- (y-L, y-L+i, • • ■ y~i), y+ -.= {yi,y2, ■ ■ ■ yN-L-n) 

The set with a splitting mark after the L-th coordinate will be denoted by S^^""-* and we 

use the y S <;=^ (y-,y+) £ one-to-one correspondance. 

For a fixed L we will often consider the expectation of functions 0(y) on with respect 

to fj, or //x; this will always mean the marginal probability: 

E^O := y 0(y)u(y-,a;i,a;2, . . .a;„,y+)dydx, y = (y-,y+)- (6.3) 

E/0:= y"0(y)(/M)(y_,xi,X2,...a;„,y+)dydx. (6.4) 
For a fixed L < - 71 and y G S^^^") let 

-1 

(6.5) 

be the conditional density of x given y with respect to the conditional equilibrium measure 
/Xy(dx) = Aiy(dx) = My(x)dx, Uy{x) := u(y,x) 



/y^(x) = /y(x) = /t(y,x) 



/t(y,x)^y(dx) 



u(y, x)dx 



(6.6) 
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Here fy also depends on time t, but we will omit this dependence in the notation. Note that for 
any fixed y G any value xj lies in the interval ly := i.e. the functions Uy(x) and 

/y(x) are supported on the set 

S^"' := |x = {xi,X2, . ■ ■,Xn) ■■ y-i < xi < X2 < ■ ■ ■ < Xn < yij C ly. 

Now we localize the good set introduced in Definition 4.1. For any fixed L and y = (y_ , y+) £ 
^(jv-ri) define 



{IIl{X) : Ae r!,ni(A) =y} = {x= (xi,X2,...,x„) : (y_, x, y+) e O}. 



Set 
Since 

from (4.19) we have 



fii = n,iL) {y e S^^'-"^ : P/,(r!y) > 1 - Ce-""""}. 
P(r!)=P/P/^(17y), 



(6.7) 



(6.8) 



P/(17i) > l-Ce"" . 
Here P/(i7i) is a short- hand notation for the marginal expectation, i.e. 

but we will neglect this distinction. 

Note that y € rii also implies, for large N, that there exists an x S ly such that (y_ , x, y+) G fl. 
This ensures that those properties of A G that are determined only by y's, will be inherited to 
the y's. E.g. y E ili will guarantee that the local density of y's is close to the semicircle law on 
each interval away from ly. More precisely, note that for any interval / = [E — E + 77,*j/2] 

of length 7]*^ = 2™--n?N~^ and center E, < 2 — k, that is disjoint from /y, we have, by (4.16), 



y€l7i, /n/y 



Af{I) 



N\I\ 



- Qsc{E) 



< 



(iV|/|)V4 



,7/12 



Moreover, for any interval / with |/| > n'^N ^ we have, by (4.18), 

yeOi, /n/y=0 =^ Af{I)<KN\I\. 
For any L with Nk^^^ <L<N{1- k^/^), let El = mj^^LN-^), i.e. 



N 



gsc{X)dX = L. 



Then we have 



-2 + Ck<El<2-Ck, 



gsc{EL) > CK 



.1/2 



(6.9) 



(6.10) 



(6.11) 



(6.12) 



Using (4.21) and (4.22) from Lemma 4.3 on the set (see (4.18)), we for any y G ^i{L) we have 



.l-a^,-l(L^-l)|<Cn^''/^ |/y|- 



NgsciEL) 



(6.13) 
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in particular 

I2/-1U2/1I <2-Ac/2 and l^yl < ^ (6.14) 

with C = C(k). 
Let 

= ^^{V) = {(y-,y+) e : < KnN-^'] (6.15) 

with some large constant K. On the set we have |/y | < Kn/N (see (6.14)), thus n^(r2) C Vl2{L), 
i.e. 

P/(f^2) > 1-Ce-"^'". (6.16) 

6.2 Localization of the Dirichlet form 

For any L < N ~ n and any y e , we define the Dirichlet form 



DlAI) ■■= J ^(Vx/)'d^^(x) 



for functions / ^ /(x) defined on .^y"^ Hence from (6.1) we have the inequality 



Ef.DL^yUfyi^)) < CnN-^D{^^) < CN^+'^nr-' (6.17) 



iV(l-2K3/2) 

where the expectation E/^ is defined similarly to (6.4), with / replaced by ft- In the first inequality 
in (6.17), we used the fact that, by (6.5) and (6.6), 



= j dxdy /t(y,x)M(y,x) i'i.y (y^/y (x)) 

dxdy/.(y,x).(y,x) yj dx /dx7.(y,xO.(y,xO ^^^'^) 

^^X:/dxdy^4(^.(y,x) 
SiV^^y /t(y,x) ^ 

and therefore, when we sum over all L e {Nk^/"^, . . . , iV(l — k'^/^)} as on the l.h.s. of (6.17), every 
local Dirichlet form is summed over at most n times, so we get the total Dirichlet form with a 
multiplicity at most n. 
We define the set 



^1 = [nk^I^ <L<N{1~ ^3/2) : Ef^DL^^J fy{^)) < CN^+^'n^-^}, (6.18) 



then the above inequality guarantees that for the cardinality of Q 



I?^^>1_£. (6.19) 



7V(1- 2^3/2) - n' 
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For L e t/i, we define 

n3 = n3iL) :={(y_,y+)GS 



(7V-ri) 
L 



then 



(6.20) 
(6.21) 



6.3 Local entropy bound 

Suppose that L E Qi and fix it. For any y G denote by 



Hy(x)=7V 

Note that 



" 1 2 2 " 



l<i<^'<n 



Hcss-Hy(x) > inf ^ 



2; - yfcl 



(6.22) 
(6.23) 



ke.iL 



for any x € /y as a matrix inequality. On the set y G r22(-^) we have 

inf ^ |x - yu\-^ > >—, ye n,{L). 

^^ly \yi-y-iV " 

We can apply the logarithmic Sobolev inequality (5.3) to the local measure fiy, taking into account 
Remark 5.1. Thus we have 



V(/y) < c-'n'N-'DL^y{J fy{^)) < Cu^N'^t'' for any y G Vl^iL) n l)3(i), L e g^. (6.24) 



Using the inequality 



for jjL = pLy and f ~ fy, we have also have 



/ l/y-l|dA^y 



< Cn^N'^T-^ for any y e f^2(i) onsiL), Le Gi 



We will choose t = N with r = such that 
i.e. ^ > lOe + a. 

6.4 Good external configurations 
Definition 6.1 The set o/good L-indices is defined by 



(6.25) 



(6.26) 



(6.27) 



■^rj, ^1 [^(Aj- Al+„+i)] 
lo 

-n+1 — Aj I 



< 



(6.28) 
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Lemma 4.4 together with (6.19) imply that 



N{1 - 2k3/2) - n7 
Notice that for any fixed L we can write 

L+n 



(6.29) 



I N \: - \ t\ ^ ly 



j=L+ 
L+n 



lE/E/, y 



^[iV(a;,-y_i)]2 



and we also have 



(f^y)>-% y 4^1^. 



and similar formulae hold when Al is replaced with Xl+u+i and y_i with j/i. 

We also want to ensure that the density on scale rj := t/q — n'^N^^ is close to the semicircle 
law. Let 

OEix) - l{\x^E\<7]/2) 

be the characteristic function of the interval [E — ri/2, E + r//2]. Consider fi'"' defined in (4.16), 
then n C ^2^°) and (4.19) imply that 



AT 



E/lfi sup ^yOE{Xi)-Qsc{E) < (iV,y)-i/4„^/i2 = 71-^/6 



|_E|<2-k/2 



Fix L IE G, consider y G and define 

/; [2y_i+7?/2,yi-77/2] 

so that if E € I* then [ii^ — r]/2,E + 7j/2] C ly. Moreover, on the set fl we know that ly C 
[-2 + k/2,2- k/2] (see (6.14)). Therefore 



/E/ylOy sup 71 '^E^£^(-^'') - £'.sc(£') 



< E/lf2 sup 

|_E|<2-k/2 



^Eo^(A.)-^...(i?) 



This gives rise to the following definition: 
Definition 6.2 Let L E Q. The set of good external points is given by 

1 



< 71"^/^ 

(6.30) 



:=f7in02n03n|y=(y_,y+)eSf-") : y y 



\Niy±i-yk)\ 



(6.31) 



" 1 
E/^ln, sup n-'yilNlx, -E\< -nA - Qsc{E) 



Eel 



y 4=1 



< n 



-7/12 
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It follows from (6.8), (6.16), (6.21), (6.28) and (6.30) that 



(6.32) 



6.5 Bounds in equilibrium 

In this section we translate the bounds in the second and third lines of (6.31) into similar bounds 
with respect to equilibrium using that the control on the local Dirichlet form also controls the local 
entropy for the good indices: 

Lemma 6.1 Let A > be arbitrary and y e y^. If t > n'^^+^N°', i.e. P > {AA + 8)e + a, then 
for p = 1,2 we have 



[N\xj-y±i\]P 



Moreover, we also have 



Proof Let O : M" - 
we have 



sup 



Be/; 



(6.33) 



(6.34) 



be any observable and fly be any event. Then for any fixed y G S*^^ 



My- 



1^2yO(/y - l)dMy <||0||^ /|/y-l|dMy <C\\0\\t,S^Jfy) 



by the entropy inequality (6.25). li L E Q and y e ft-ziL), then we have by (6.26) that 
For a given y G 3^^, we set the observable 



(6.35) 



with llOlloo < Cn-^P+i < cn^^+i. Then, for r > n^^+^iV" we obtain from (6.31) and (6.35) that 

On the complement set fl^ we just use the crude supremum bound together with the bound on 
P/^(f}^) in the definition of fli (6.7): 



IE.. EE 



l^.l{N\x,-y±i\>n-^) 
[N\xj-y±i\Y 



Combining the last two estimates proves (6.33). 

The proof of (6.34) is analogous, here we use that the corresponding observable has an L° 
bound 



< n 



1-7 



This completes the proof of Lemma 6.1. □ 
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7 Cutoff Estimates 



In this section, we cutoff the interaction with the far away particles. We fix a good index L Cz G 
and a good external point configuration y G yj^. Consider the measure /iy = e^^^/Zy with 



^y(x) = N 



Y,xjl2~2N-^ log|x,-x,|-2iV-i^log 



Xi - yk\ 



The measure fiy is supported on the interval Iy = 1,2/1). 
For any fixed y, decompose 

n 

Hy= 7^1+^2, 7{2(x)-^F2( 



where 



and 



with 



N 



|fe|>ri^ 



Hl(x) = -2 J2 l0g\xJ-X^\-YVlix^) 

l<i<j<n 



i=l 



|fc|<ri« 

where _B is a large positive number with Be < 1/2. We define the measure 

MW(dx) :=^— ^. 

^1 

Lemma 7.1 Lef L E Q and y G 3^l. For _B > 20, we have 



sup 



d/iy 



■(x)-l 



(7.1) 



(7.2) 
(7.3) 
(7.4) 



(7.5) 



(7.6) 



This lemma will imply that one can cutoff all y^'s in the potential with |fc| > . 
Proof. Let 

SV2 max V2 — min V2, 

then, by (6.15) and y g y^, we have 

SV2 < |/y|||Vg|U < CTiiV-iliy^'ll^ 
In Lemma 7.2 we will give an upper bound on ||V2'||oo, and then we have, for B > 20, that 

SV2 < Cn-''/^+\ 

Since 



d/iy 

we obtain (7.6). □ 



(x)-l 



< CnSV. < 
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Lemma 7.2 For B > 20 and for any L e Gi, y G we have 



sup 1^2(2;) I ^ sup 



Nx 



\k\>r 



x-yk 



(7.7) 



Proof. Recall that y G C ili implies that the density of the y's is close the semicircle law 
in the sense of (6.9). Let 

(7.8) 



NQsc{y-i) 

Since y € fii, we know that |y-i|, |yi| < 2 — k/2 (see (6.14)), thus Qsciv-i) > c > 0. Taking the 
imaginary part of (4.3) for \z\ < 2 and renaming the variables, we have the identity 



X = 2 



Furthermore, with y = h{y-i + yi) we have 



Qscjy) 
x-y 



dy. 



Qsc{y) 



\y-v\<d X-y 



dy 



< Cd 



since y is away from the spectral edge thus Qg^ is continuously differentiable on the interval of 
integration [y — d,ij + d]. Thus 



Nx-2N 



Qsc{y) 



l\v-y\>d X-y 
therefore to prove (7.7) it is sufficient to show that 



dy 



< CNd < Cn^ 



sup 



- E — 



\k 



|>„B ^-Vk J\v-v\>d = 



Qsciy) 



X-y 



dy 



(7.9) 



We will consider only k > and compare the sum with the integral on the regime y > y + d, the 
sum for k < —n^ is similar. 
Define dyadic intervals 

/„, = [y + 2"Xy + 2'"+V], m = 0,l,2,...,log^ 

Since y £ C r^i, i.e. max|?/fe| < there will be no yt above the last interval I\ogN- We 
subdivide each /,„ into n^/"^ equal disjoint subintervals of length 2"^dn~^/^ 

/m = U Im,(., Im.i = [y*m,t-i,y*m,i] with y^^^^ := yi + 2™<i(l + fe"-^/^). 

£=1 

For y e C fii, the estimate (4.22) holds for yi and y„B, i.e. 



NQsc{yi){ynB - yi) - {n^ - l) 
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if S > 20, which means that 

\y^B-{yi+d)\ < ^ +^ 
(using Be < 1/2, < N^'"^), i.e. 



1 



Qsc{y-i) Qscivi) 



C7n4S/5 

< — + < — (7.10) 



TV 



iV2 



|y„B - {y + d)\ < 



N 



(7.11) 



by using the definition of d from (7.8), the fact that gsc(y±i) is separated away from zero and that 
|Jy| < CnN-'^ from (6.14). 
Therefore we can estimate 



logJV 



,B/2 



l(2/j e l7n,£) 



- y — — - F F F 

N ^ X - Vk N ^ ^ X- yi 



^ 1 V- 1{] <n^,yj>y + d) 1 V- l(j > % < y + rf) (^-l^) 
- N 2^ \x-yj\ N 2^ 



N ■ 7 



\x-y3\ 



To see the last estimate, we notice that in the first summand we have y + d < yj < y„B < 
y + d + Cn'^^/^N~^ by (7.11), i.e. aU these y/s he in an interval of length Cn''^/^iV"\ so their 
number is bounded by Cn^^^^ by (6.10). Thus the first term in the right hand side of (7.12) is 
bounded by Cn'^^^^N~-^d~'^ < Cn^~^^^; the estimate of the second term is similar. 
Using that 



1 



1 



yeIrr^,i X — y yei^^e x - y 



1 2"'dn-^^^ 
< \Imj \ max max < C — — — rrr^ — < 



C 



xeiy yeir^.e {x - yY 



(2"d)2 



2™dnS/2 



we have 



\N ^ ^ ^ x-y, 
m=o e=i jeJL •' 



N 2-^ 2^ 2^ -r — 1/ N 2^ 2-^ X — V* , 



m=0 1=1 
m=0 £=1 



(7.13) 



In the second line we used that N{Im,t) < KN\Imi\ by (6.10) since y € fii and Im,£ H ly 
We use that for y e ili we can apply (6.9) for / = and we get 



m>0 £=1 



a; - y,n,£ 



EE 

m=0 £=1 



-EE ^^^"^'^ ^ N\I„,M,{y*^^,) 



a; - ym,£ 



Cni/^^ 1^'^' (2™n'^/2)3/4 (7.14) 



< 



EE 



m=0 £=1 
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where we used that |/™,^| = 2™dn--^/2 < C-2™n^/2iV-i (see (7.8)) and that \x-y*^J > 2™-^ > 

Finally, the second term on the left hand side of (7.14) is a Riemann sum of the integral in 
(7.9) with an error 

(7.15) 

since on each interval Irn,i we could estimate the derivative of the integrand as 

sup -; < C 5- 

ye/^, dy x-y " V2'"nS/ 

Combining (7.12), (7.13), (7.14) and (7.15), we have proved (7.9) which completes the proof of 
Lemma 7.2. □ 



8 Derivative Estimate of Orthogonal Polynomials 



In the next few sections, we will prove the boundedness and small distance regularity of the density. 
Our proof follows the approach of [26] (cf: Lemma 3.3 and 3.4 in [26]), but the estimates are done 
in a different way due to the singularity of the potential. For the rest of this paper, it is convenient 
to rescale the local equilibrium measure to the interval [—1, 1] as we now explain. 

Suppose L Cz Q and y G 3^l- We change variables by introducing the transformation 



r 1 11 ~ rr.^ ^ ^{w-y) - y-i+yi 
[—1, IJ, w = 1 (w) := — 7— — , with y := 



and its inverse 



then T{Iy) 



w = T 



y 



-1, 1]. Let fly be the measure ^y^"* (see (7.5)) rescaled to the interval [—1, 1], i.e 
1 



Aiy(dx) 



■ exp 



J2Uy{x,) + 2 J2 log I 

l<~i<j<~n 



i=l 



dx 



on [-1,1]" with 



2 ^ 

Uy{x) := ^og\x-yk\. 



|fc|<«s 

The £-point correlation functions of fiy and Jly are related by 



p'n\xi,X2,..-Xn) =piP[y + 



pi^\xi,X2,-.-Xn)- 



(8.1) 



i.2) 



(8.3) 



Let Pj{X), j = 0, 1, . . . denote the real orthonormal polynomials on [—1,1] corresponding to the 
weight function e""^^''''', i.e. deg pj = j and 

p,(A)pfe(A)e-"^?(^)dA = 5,fc 
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and define 

^,(A):=p,(A)e-"^^(^)/2 (8.4) 

to be orthonormal functions with respect to the Lebesgue measure on [—1,1]. Everything depends 
on y, but y is fixed in this section and we will omit this dependence from the notation. 
We define the n-th reproducing kernel 

n-l 

if„(A,i.) = ^Vj(A)^,H (8.5) 

3=0 

that satisfies ^ 

KniX, j ^ Kn{X, OKniC, i^)dC- (8.6) 

The density is given by 

g„{X)=li'\X) = n-'K„{X,X) (8.7) 
and the general ^-point correlation function is given by 

pI^HX^, A2, . . . , A,) = i!i^det{i^„(A,, Afc)}^^,,^i (8.8) 

following the standard identities in orthogonal polynomials. For the rest of the paper we drop 
the tilde and all variables will denote the rescaled ones, i.e. all x variables will be on the interval 
[— 1, 1]. All integrals in this section are understood on [—1, 1]. 

The basic ingredients of the approach [26] can be described as follows: Suppose that the 
following two properties hold for the normalized function ■0 = i/ij, j — n — 1, n, and for some fixed 
K > 

f \ij'{x)\^dx < Cn''+' (8.9) 

J\x\<1~k/2 

I ^p^{x)dx <Cn'^, jxol < 1 - k (8.10) 

for some positive cr, d, e with cr < 1. We will take take 6 = 1/4, same as in [26]. Let 

^ = 1- 1 tl;{x)dx 

J\x-xo\<£ 

be the average of ip in the interval |x — xol < i with some xq, \xo\ < 1 — k and i < k/2. We have 

Using (8.10) to estimate \^\ < Cl~^^'^rS'^^^^/'^ (under the assumption that £ < -nT^) and using 
(8.9), we obtain 

Choosing I = ri^i+(''~*~^)/2 we have 

|V'(a;o)| < + (8.11) 

Note that |^(a;o)| = 0{n^~' ) with some e' > provided that cr + £ < ^. Suppose we can also 
prove that 

\Q'{x)\<Cn'"{^Pl_,{x)+^l{x)) (8.12) 
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with some small power e", then it will follow that |£i'(a;)| < o(n) and this proves the regularity of the 
density over a distance of order 1/n. Together with the fact that the density is well approximated 
with the semicircle law on scales bigger than 1/n this will show that the density is close to the 
semicirle law pointwise. In [26] the regularity of the density on larger scales followed from the 
smoothness of the potential (Theorem 2.2 of [26]). In our case this follows from (6.34) which is 
a consequence of the fact [15] that the semicircle law is precise on scales slightly larger than 1/A^ 
that corresponds to scales bigger than 1/n after rescaling. 

In proving (8.9), (8.10) and (8.12), one basic assumption in [26] requires the potential to be 
in C"^^" for some > 0. The potential for our probability measure (8.2), parametrized by the 
boundary conditions y, is singular near the boundary points {±1}. In order to control these 
singularities, besides using some special properties of orthogonal polynomials, we rely on [15] via 
(6.33) to provide essential estimates such as level repulsions. It turns out that we can only establish 
(8.9) and (8.10) for i^jjj < n— 1 following this idea. The case of j = n has to be treated completely 
differently. We now start to prove (8.9) for ipj,j = n — l,n—2. 

Lemma 8.1 Suppose that L G CJ, y G and, after rescaling that sets y^i = —1, j/i = 1, let the 
y- configuration satisfy 

N<ii<it^„. l^-ycl i<|^„. ^\yk-yi\ \yk-y-i\i 

(note that the boundary terms fc = ±1 are not included in the summations). Furthermore, assume 
that the density p„ satisfies 

[(.T + l)-2 + (1 - x)-'^]gn{x)dx < Cn^'^ (8.14) 
for some A > 60B. Then for the orthonormal functions ^jj from (8.4) we have 



1 

2 



1 



1 ^ ^_ 



3 
2 



dx<n^'' j<n-l. (8.15) 



1 

{ij'Jx)fdx < Cn^+^'^ i <n-\. (8.16) 



1 



Notice that the assumptions (8.13) and (8.14) follow from (6.31) and (6.33). 

In this section and in the subsequent Sections 9 and 10 we work with orthogonal polynomials 
on [—1,1] with respect to the potential Uy{x) (see (8.2)). For brevity, we set V{x) = Uy{x) in 
these three sections and we make the convention that the summation over the index k that labels 
the elements of the external configuration y will always run over integers with for 1 < < 
unless otherwise indicated. 

Proof. For simplicity, let p(x) = pj{x) and il}(x) = ipj{x). Then 



/. 



1 pi pi 

(p'(x))2e-"^(=^)da; = / -p" {x)p{x)e-''^^''Ux + n p' {x)pix)V' {x)e-''^'-^^dx. 
1 J-i J-i 
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Note that e~"^'^^ is zero at the boundary x — ±1 so the boundary term vanishes in the integration 
by parts. Since p{x) is an orthogonal polynomial, it is orthogonal to all polynomials of lower degree, 
thus the first integral vanishes. By Scliwarz inequality, the second integral is bounded by 

n /%'(a;)p(x)V^'(x)e-"^("Mx < i C {p' {x)fe-''^'^^'^dx + \ [ p\x){nV'{x)fe~''^^''^dx 



We have thus proved that 



The last integral is bounded by 



\x){nV'{x)fe-''^^'''>dx < h + h 



with 



1 



1 



(a; -1)2 (a- + 1)2 



k^±l 



x-yk 



ijP'{x)dx. 



From (8.13), and the normalization of t/i we have 



.17) 



(8.18) 



(8.19) 



(8.20) 



To control the term Ii, we separate the integration regimes |a:± 1| < and —1 + n^^ l£ x < 
1 — n^^ for some big constant A. In the inside regime, we can use |'(/'(a;)P = |V'j(a:)P < ngn{x) 
since j < n ~ 1. From (8.14) we obtain 



(a;-l)2 (x + l)2 



t(;'^{x)dx < Cn^+^'^. 



.21) 



To estimate the singular part of the integral in Ii near the boundary points, we can focus in 
estimating 

^+""" V'(x) 

-1 



the other cndpoint being similar. Let 



9{x) = 



[l + xY 
x + l 



dx 



Notice that g{x) is a polynomial of degree deg g < 2n?^ + n. From the Nikolskii inequality (see, 
e.g.. Theorem A.4.4 of [5]) 



.g|l4<C(degg)^-^|l5lli/4<Cn^'^|1.9|li/4 



.22) 



with some imiversal constant C. Here \\g\\p is defined as {j^i \9{x)Y'dxY ^ for any < p < 00. 
Notice that Nikolskii inequality holds between spaces even with exponents p <\. By the Holder 
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inequality, 



!i.9ii!/4 = [J\ lai^r'dx^' < ( J\ \9{^)\'^'\^ + ir^^dx) ( \x + ir/'dx 

1/4 

<C[ I \gix)\^ix + l)^dx 

= c\m/' = a 

Thus from (8.22) wc have j|gj|4 < Cn^^^ and by Holder inequality we have 

/'^""I da; < Cn-''/^\\g\\l < Cn^"^'^/^ < C (8.23) 
L (2; + !)^ 

provided A > QOB. Together with (8.21), this proves Ii < Cv}^^"^ . Combining this with (8.20) we 
obtain a bound Cn^+^^ for (8.18) which proves (8.15). 
Using this estimate and (8.17) we obtain that 

\p'{x)\\-^^^''^dx<Cn'+^\ 

J-i 
Since 

|V''(x)|2 < C[p'{xf +p'{x){nV'{x)f]e-"^^^\ 

we have thus proved that 

1 .1 

W{x)\^dx < Cr?+^^ + C p\x){nV'{x)fe-"^^'''>dx < Cr?+^'^ (8.24) 
1 J -I 

by using (8.18). This completes the proof. □ 

9 Bound on smeared-out orthogonal polynomials 

Lemma 9.1 Let k,Sq > be arbitrary positive numbers. Let L £ Q , y ^ y^, suppose that the 
y- configuration satisfies (8.13), (8.14) and the density Qn{x) > (5o > for all |a;| < 1 — k. Let 
■0 = '0n-i or ipn-2 be an orthogonal function. Then we have 



n 



/ i;^{x)dx <Cn^'^, |xo|<1-k (9.1) 



with a constant C depending on k and 5q. 

Proof. For any z = u + irj Cz C with 77 > 0, let 



/ X / Qnix), 

m„(z) = / dx 



denote the Stieltjes transform of the density and denote by 

\2 



Gn{x,y) =p}^>{x,y) - g„{x)gn{y) = -y H : 

n[n — 1) n — 1 



(9.2) 
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the truncated correlation function, where pn was defined in (8.3) and computed from 
will again drop the tilde in this proof. 
We have the identity 



We 



V'{x)Qn{x) n-1 2/ X 1 

dx = m^{z) - - 



X — z 



n J (x — z) 



Gnix,y) 
{x- z){y- z) 



dxdy. (9.3) 



This identity follows from expressing p„ by an integral over n — 1 variables of the equilibrium 
measure and then integrating by parts (see also (2.81) of [26]). Hence, by using (8.6), we have 



mliz) 



V'{x)g„ix) 



dx 



1 

2^ 



Kl{x,v) 



x — z y — z 



dxdy. 



(9.4) 



The last integral can be bounded by 



X — z y — z 



dxdy 



< 



Kl{x, y)- — rirdxdy 

[x — zj (y — zy 



<7?'4 / Kl{x,y){x ~ yfdxdy < Cr]- 



where, to estimate the last integral, we have used the Christoffel-Darboux formula 



, X , ■>pnix)-)pn^i{y) - 'lp„{y)'>Pn~l{x) 
J<n(X,y) = Jn-1 7^ , Jn-l 



We have thus proved that 

ml{z) 



x-y 



V'{x)gn{x) 



xij:n-i{x)'ipnix)dx. (9.5) 



X — z 



dx = 0{n rj ), z = u + ir] 



(9.6) 



We define a new measure fi on [—1, 1]" ^ as 



Tl-l 



My (da:i, . . . ,da;„_i) = — 



■ exp 



-nJ2vix,) + 2 J2 log 

i=l l<i<j<n-l 



where we already omitted the tildes and recall that V{x) = Uy{x). Note that this measure differs 
from (8.1) written in n — 1 variables in that we kept the prefactor n in front of the potential. Define 



Qn {x) 



71-1 



J fiy{x,dx2,dx3, . . .da;„_i) 



and note that 



n-2 



3=0 

where ipj^s are defined in (8.4). This latter formula follows from the recursive relation of the 
correlation functions for GUE-like ensembles, therefore 



^l-iix) ^ n{gnix) - g„ix)). 
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Let 



dx 



m„ (z) 

be the Stieltjes transform of g^; then we have the analogue of (9.6) 



-dx = 0{n-^Tj-''). 



Subtracting this from (9.6), we have 



r(x)VLi(x) 



d.T + 0(n-^77-*) 



(9.7) 



Assume that u = Rez satisfies \u — xo| < n By adding n(m„(z) — to„ {z))V'{u) to the both 
sides of (9.7), we obtain 



n{m„{z) - TO„ (z))(m„(z) + m„ (z) + V (u)) = 



(r(x)-r(u))V4-i(x) 



dx + 0(^-^77-'') 



We divide the integral into \x — xo\ < and ja; — xo| > iy/2. In the first integration regime, since 
l^^ol < 1 — J^, we have 



i 



\x-xo\<u/2 



V'{x) - V'{u) 



X — z 



V'^_i(x)d.T 



1 v-^ 1 

\x-xo\<<y/2 n \X-U\ 



1 



1 



Uk ~ X Uk-u 



(9.8) 



\x-xa\<y/2 



iljl_i{x)dx. 



Since |x| < l — \u\ < 1 — 1//2, we have \yk — u\ >2v^ for any k. Thus, by (8.13), the prcfactor 
in (9.8) is bounded, uniformly in|x|<l — i^/2, by 



-y— 

ri ^ \x — 

C 



n — ' \x — u\ 

k 



1 



1 



< 



Vk- X Uk -u 
1 



< 



n ^ — ' 



, -2 <Ay^ 

n ^ \yk - x\\yk - u\ m ^ \yk - x\ 



E 



1 



E 



1 



(9.9) 



where the constant C depends on v and wc recall that y-i = — 1, yi = 1 in the rescaled variables. 
In the second integration regime we use |x — 7i| > jx — a;o| — |a;o — m| > 1^/4 and obtain 



/ 

J\x 



\x-xo\>u/2 



V'{x) - V'{u) 



ipl_iix)dx 



C 
< — 

n 



\x-xo\>v/2 J, 



1 



■)pl_i{x)dx 



^ n ,1^ 



1 



Vk-x 



Vk-x yk - u 
^l_Ax)dx + -Y - — i — : < Cn^T 



where we have used (8.15) and Holder inequality to estimate the first term in the last line and 
using (8.13) for the second term. 
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We have thus proved that 

n |(to„(z) - m,7(z))(m„(z) + m~(z) + V'{u))\ < rC'^ + Cn"^??""'. 

Hence 

n m„(z)-m„(z) < — 

using that lmm~{z) > 0. Since Qnix) > (5o > by assumption, Imm„(z) is bounded from below. 
Thus, choosing rj = 7i~^/^, we obtain 



i'i-iix) 



dx 



X — z 

with C depending on v and (5o. Taking imaginary part, we have 

7] 



( y2^ ^ijl_,{x)dx<Cn^\ 
(X — uy + ry^ 

for any u with |u — xo| < i] = n^^/"*. Integrating over \u — xo\ < rj and using 



/ 7 2 - l(l^~^o| < J?) 



'\u-xa\<r, {x~uY +7]- 

with some positive constant c, we have proved (9.1) for ip = ipn-i- The can be 

done in a similar way. This completes the proof of Lemma 9.1. □ 

Corollary 9.2 Suppose that the y -configuration satisfies (8.13), (8.14) and the density satisfies 
Qnix) > Sq for all \x\ < 1 ~ n for some Sq,k > 0. Let ip = ■0^ with j = n — 2,n — l,n he an 
orthogonal function. Then 

sup |V'(a;)p < Cni-*+"'^ (9.10) 

|a;|<l-K 

with a constant C depending on k and 5q. 

Proof. For the case j = 7i — 2, n — 1, the estimate (9.10), even with a better exponent, follows 
from the argument leading to (8.11) from the two assumptions (8.9) and (8.10) with S = 1/4, 
e = 67 and a = 37: 

sup |^j(a;)p < Cni"Hf7 j = n-2,n-l. (9.11) 

|a;|<l-K 

The estimate (8.9) was proven in Lemma 8.1, the estimate (8.10) follows from Lemma 9.1. 

The proof of (9.10) for tp — ipn requires a different argument. Let aj be the leading coefficient 
of the (normalized) j-th orthogonal polynomial, i.e. Pj{x) = ajx^ + .... Observe that Pnix) = 
na„x'^~^ + . . . = n(a„/a„„i)p„_i(x) + . . ., where dots mean a polinomial of degree less than n — 1. 
Thus 



p:,(a;)p„_i(a;)e-"^(-Mx 
Pn{x)p',,_,{x)e-"^^-Mx + I p„(a;)p„_i(x)ny'(x)e-"^(^)dx. 
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The first integral on the riglit liand side vanishes. By the Schwarz inequaUty, we have 



n\ar, 



a„_i 



< / b„(xK_i(a;)ny'(x)|e-"^(-)da; 



< 



p2(a;)e-"^(")d2; 



1/2 



1/2 



(9.12) 



where the second integral was estimated in (8.15). 

Recall the standard three-term recursion relation for orthogonal polynomials 

XPn-l = apn + bpn^i + CPn-2 



(9.13) 



with some real numbers a, 6, c depending on n. By comparing the leading coefficients, we have 
a„_i = aa„ and by ortho normality, we get 



In particular 



1 



a;2|j2_i(a;)e-«^Wda; < 1. 



fln-l 



from (9.12). Hence, from (9.13), 

\Pn[x)\ < |a"^[(a; - b)pn-i{x) - cpn-2{x)]\ < Cn^'^[\pn^i{x)\ + \pn^2{x)\]. 
Using the bound (9.11), we obtain (9.10) for t/j = i/j^ as well. □ 



10 Regularity of Density 

Lemma 10.1 Let L d Q , y € Suppose that the external y- configuration satisfies (8.13) and 
(8.14) and assume that 7 < Then for any k > we have 



sup \g'„{x)\ < Cn^^^l_,ix)+i^lix) + 1) < Cn^-i+i^T 

|X|<1-K 



sup \gn{x) - gsc{y)\ < Cn '^/^^ 

\x\<1-2k 



(10.1) 
(10.2) 



where the constant C depends on k. 

Proof. The derivative of the density can be computed explicitly (see, e.g., (3.63) of [26]) as 



e'^ix)^ / [r{z)-rix)]Kiix,z)dz. 



(10.3) 



In our case 



V'iz)-V'ix) = -Y^ 
11 ^ — ^ 



x-yk z-yu 
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and 



1 



1 



x-yk z~yk (x-yk)^ {z - yk){x - ykf 
From the Christoffel-Darboux formula, we have 



(10.4) 



{x- z)°'K^{x,z)Az 



<C{i;l_^{x)+^l{x)), a = 1,2. 



Since |x| < 1 — k, we can estimate the contribution to (10.3) from the first term in (10.4) by 
1 



E 



n{x - ykY 



{x - z)K^{x, z)dz 



where we have used (8.13) to bound the factor in front of the integral 



sup 



< 



E 



1 



E 



1 



IxlKi-K-;^ n{x- VkY ^"^j^'n-^W-i-ykl ^^^nK\yi-yk\ K^n 
The contribution from the second term in (10.4) is bounded by 



+ 4- < Cn^^- (10-5) 



C 



1 {x~zf 
n ^ {x-ykf J -I {z - yk) 



< 



K'^{x, z)dz 



7=0,1 -^-1 k ' 



yk\ 



V'L,-i(^)d2. 



The integral is estimated as 
1 



^ n ^ — ' 



\z-yk 



ipl+j_T^iz)dz < 



1 nE [h" 



1 



Vkl \yi-yk\ 
1 1 



^l+j_iiz)dz 



(10.6) 



1 + ^ 



?/'„+,_i(z)dz. 



The first term on the right hand side is bounded by Cn'^'^ using (8.13). In the second term, we 
split the integration into two regimes: [z] < 1 — and 1 — n^^ < |z| < 1 with some A > GOB. 
In the first regime, we use the bound (9.10) to obtain CAn^^^^^^^'^logn < C* if 7 < In the 
second regime we use the bound (8.23). This proves (10.1). 

For the proof of (10.2) we use the derivative estimate and the fact that the density is close to 
the semicircle law on scale n~^+'^ as given in (6.34). For any x,y E [— 1 + 2k, 1 — 2k] we have 

Qix) = g{y) + [ g{u)du 
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Taking the average on the interval I = [x — , x + ^^'''], we get 

g{x) - ni-^ g{y)dy\ < n-'+''\\g'\\^ < Cn-^l^+^''\ (10.7) 

Using (6.34), we have 

with /* := T^^{I), where we also used that 

\g,c{T~\x)) ~ Q,M\ < \Iy\ sup \g'{x)\ < CnN-\ 

\x\<2-K 

Combining these inequalities, we arrive at (10.2) and this completes the proof of Lemma 10.1. □ 

11 Proof of the Main Theorem 2.1 

Let V{x) — Uy{x) be the external potential on / = [—1,1] given by (8.2) after rescaling. Notice 
that V is continuous on (—1,1) and lim|^|_j.]^ l/(x) = oo. Let ^{dx) be the equilibrium measure, 
defined as the unique solution to the variational problem 

iirf^iy" J log\s-t\-^iyids)v{dt)+ J V{s)iy{ds)Y (11.1) 

where is the space of probability measures on [—1, 1]. For general properties of the equilibrium 
measure, see, e.g. Chapter 2 of [24] (and references therein) that specifically discusses the case 
of compact interval / and continuous potential going to infinity at the endpoints. We point out 
however, that we follow the convention of [9] and [26] in what we call external potential; the 
potential in [24] and [23], denoted by q{x) and Q{x), respectively, differs by a factor of two from 
our convention: q{x) ~ Q{x) ~ ^V{x). 

The equilibrium measure v with support S{i') satisfies the Euler-Lagrange equations 

j \og\s-t\-^u{ds) + ]^V{t) teS{i^) 

log|s-t|-V(ds) + ix^(t) > C teI\S{i^) 

and S{iy) C ( — 1,1) (Theorem 2.1 of [24]). Moreover, since V is convex in (—1,1) such that 
lim|3.|_j.i V{x) = oo, the support 5'(z^) is an interval, S'(z^) = [a,b], whose endpoints satisfy —1 < 
a < b < 1 and they are uniquely determined by the equations 

' .0, ^r^xM^^i. (11.2) 



a ^{s^a){b-s) 2^ 7a ^{s~a){b-s) 

according to Theorem 2.4 [24] (after adjusting a factor of 2). 

In our case, the potential V and thus the equilibrium measure v depend on n and the external 
configuration y in a non-trivial way. The main result of the recent work of Levin and Lubinsky [23] 
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proves the universal sine-kernel behavior for the correlation function of the orthogonal polynomials 
with respect to a general n-dependent potential. This result fits exactly our situation, after the 
conditions of [23] are verified. 

We recall the main result of [23] in a special form we will need. 

Theorem 11.1 For each n > 1, consider a positive Borel measure /i„ on the real line whose 2n + l 
moment is finite. Let / = [^1,1] cind assume that each fi^ is absolutely continuous on I and they 
can be written as 

Ai„(dx) = Wl''{x)Ax 

where the non-negative functions Wn are continuous on I . We define the potential Q„ = — log Wn : 
I —5- (— OO, +oo] and let f„ be the solution of the variational problem (11.1) with V ~ Vn ^ 2(3„. 
Let J be a compact subinterval of (—1, 1). Assume the following conditions 

(a) The equilibrium measure is absolutely continuous with i-'„(da;) ~ gn{x)dx, where g„ is positive 
and uniformly bounded in some open interval containing J ; 

(b) The family {Q'n}n=i,2,... is equicontinuous and uniformly bounded in some open interval 
containing J; 

(c) The density p„ {x) of the first n orthogonal polynomials with respect to fin on L ( defined in 
(8.7)j satisfies C^^ < gn{x) < C in some open interval containing J; 

(d) The following limit holds uniformly for E Cz J and a in any fixed compact subset o/R; 

hm f'(^) . 1. 

Then for the n-th reproducing kernel of the measure fin on L (defined in (8.5)J we have 

1 a „ b \ sinTrfa — 6) 

lim Kn(E+ 77^, E+ / / 11.3 

n^oc, ngn(E) \ ngni-t) ng„XE) J TT[a-b) 

uniformly for E ^ J and for a, b in compact subsets of M. 



First we verify the conditions of this theorem for our case. We consider the sequence of measures 
on R that vanish outside of / = [—1,1] and that are given by /x„(da;) = e~"'^^^^-'dx on J, where 
y € 3^L is a sequence of good external configurations after rescaling for some L G Q. Recall that 
the concept of good external configurations depends on N, i.e. Q = On and we recall the relation 
(4.15) between n and N. We set J = [— 1 + a, 1 — a] for some ct > 0. The measure fin is clearly 
absolutely continuous (actually it has a polynomial density), and since it is compactly supported, 
all moments are finite. Conditions (a) and (b) will be verified separately in Appendix F. Conditions 
(c) and (d) follow directly from (10.2) in Lemma 10.1. 

Now we start the proof of the main Theorem 2.1. Throughout this proof, E is the expectation 
for the Wigner ensemble with a small Gaussian component, i.e. E = Ej^ with the earlier notation. 
All constants in this proof may depend on k. We will use the results obtained in Sections 4-10. 
In these sections, various small exponents a, /3, 7, e, and various large exponents A, B need to be 
specified. The exponent /3 is given in the theorem and it can be an arbitrary positive constant. The 
other exponents are determined in terms of [3 subject to the following requirements: (3 > lOe + a 
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(6.27), P>{4:A + 8)e + a (Lemma 6.1), Be < 1/2 (Section 7), B > 20 (Lemma 7.1) and A > 60B 
(Lemma 8.1). Finally, 7 < can be an arbitrary positive number, independent of /3. Obviously, 
these conditions can be simultaneously satisfied for any /? > if a, 7, e are chosen sufficiently small 
and A, B sufficiently large. All constants in the proof depend on this choice. 

Let 0{a, b) be a bounded function and S < k/2. In (2.9) we have to compute the limit of 



pf2 rEa+S ^ 

= ^J^ 5 J '^^^^PNi^^^)0[{u-E)Ngsc{E),{v~E)NQsc{E) 

N 1 /■^»+* ^ / 

126 Je 5 '^^^T.O[i>^j - E)Ngsc{E),{Xk- E)Ngsc{E) 



(11.4) 



7V- 
N 



'Eo-S 



(11.5) 



where we have changed variables. Using the form of O given in (2.8), we have 

T(iV,,5)=E5^- / dE g{{X, ~ Xk)Ngs,XE))h[{^^ ~ E)NgUE) 
We first show that 

sup sup T{N,5) < C (11.6) 

S<k/2 NeN 

with a constant depending on k. To see this, let i? be a large number so that g(x) ~ h{x) ~ for 
|a;| > R, then 



T{N, S)<CEJ2t^ dE lll[\X,^E\< CR/N] 

j^k -^Ea-S 
1 pEa+S 

<C— dEEAf'^[E-CR/N,E + CR/N] < C, 

2o Jeh-s 



(11.7) 



lEo-5 

where we have used that ini{gsc{E) : \E — Eq\ < 6} > c > and that 

^^fI < Ck{N\i\)'' (11.8) 

for any interval / of length |/| > 1/A^. The bound (11.8) follows from Eq. (3.11) in [15] after 
cutting the interval / into subintervals of size l/(2iV). 

The estimate (11.6) and similar ideas allow us to perform many cutoffs and approximations. 
For example, we can replace gsc{E) in g and hhy g := gsdEo) in the definition of T{N,S), see 
(11.5), at the expense of an error that vanishes in the limit (5 — > 0. We shall give a proof in case 
we perform the change for, say, g: 

N , ^E„+S 



" 1 rJ^a+o A + Ai 

^ / 5((A, - Xk)Ng,,iE)) - g{{X, - Xk)Ng,,iEo)) h({^^ - E)Ng,,{E) 

f^k Je„-5 V 2 

N 1 pEa+S 

<C<5EV— / AE \\l[\Xi- E\<CR/N]<C5, 



j^k " "'^0-'' e=j,k 
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where we used that g'sdE) is uniformly bounded on [Eq — S,Eo + d] C [—2 + k/2, 2 — k/2]. We will 
not repeat this type of simple argument in this proof. 

After this replacement, we can perform the dE integration using that J h ~ 1: 



r(7V,<5) = E^g((A, - Xk)Ng)- / dE h 



[^^-E)N,)+0i6) 



1 ^ 



(11.9) 



2NgS 



where the last error comes from the contribution of eigenvalues within CR/N distance to Eq ± S. 
With the notation 



Xf — En 



< S 



and using (11.4), we thus need to prove that 



lim lim T*(N,S)^ / g(a-b)\l- 



sin7r(a — 6)\ ^ 



dad&. 



r(a - b) 

Recall the definition of UlsdE) from (4.12) and its inverse function '\ftJ^^{E). Note that 

{m-^^YiE) < CkT^I'^ if -2 + K<E <2- K. (11.10) 

We define 

Xn^eM < 3 < M+), M± ^ N ■ ± <5), 

and write 

l(|A, -Eo\<s'^= xn.EoAj) + Uj, (11.11) 



where Uj is the error term, defined as the difference of 1 ( 
have 

N 



Aj - Eq < S) and XN,Eo,s{j)- We thus 



-^E^.9((A,- Afe)7V^,)xiv.Bo.5(j)l(|Afc -i?o| < s) 



1 ^ 

—£Y,9{{X,^\k)NQ)U,l(\\k-Eo\ <S 



(11.12) 



2NgS 



The last term is bounded by 



E 



-y 



2Ng5 



N 



y l[\Xk-Eo\<S)g{{Xj-Xk)Ng) 

k:k^j 



2NgS^ ^ 

j 



1/2 



(11.13) 



The first expectation is bounded by 

^E l(\Xk - Eo\ < S)l(\\k' - Eo\ < 5)l(\Xj - Xk\ < C/n)i(\X^ - Xk,\ <C/N 



2Ng6 



k,k' ,j 



43 



Splitting the interval [Eq — S — C/N, Eq + S + C / N] into overlapping subintervals of length AC/N 
with an overlap at least 2C/N, we get that this last expectation is bounded by 

Vetv? < C, 

2NgS ^ h - ^ 



where we used the moment bound (11.8) with k = Z and the fact that the number of subintervals 
is CN5. 

Since D^sc is monotonic, the second expectation in (11.13) is bounded by 

n 2 



2NgS 



Xj — Ef) 



< S 



li\m;,Hj/N)-Eo\<6) 



On the set il"^ we estimate the difference of the two characteristic functions by 2, and wc get from 
(4.19) that the contribution is subexponcntially small in n. On the set we can use (4.21) and 
we see that the difference of the two characteristic functions can be nonzero only if 

6 - Cn-'^l^ < \m-^\j/N) ~Eo\<S + Cn'''/^ 

i.e. the number of j's this can happen is bounded by CNn~^/^ . Recalling (4.15), we get 



lim E 



2NgS ^ ^ 



= 0, 



therefore the second term in (11.12) vanishes in the — > oo limit. 



This shows that we can re 



place 1^ 



Aj - En 



< (5 ) by XN,Eo,5{j) in the definition of T* a with 



negligible error and wc can do similarly for k instead of j. Therefore, we need to prove that 



1 



lim lim E 

5-i.o AT-i-oo 2Ngd 



J2 .9((A, - Xk)Ng) ^ g{a- b) 



1 - 



/sin7r(a — b) 
[ n{a~b) 



dadb. 



and without loss of generality, we can assume that g > 0. 
Wc define 



Xr 



J2 g{{X,-\k)Ng). 



L<j,k<L + n 
j^k 



and let 



EX 



We claim that 
1 „ 



2Ngd 



J2 g{{\,-Xk)Ng) = 



M_ <j,k<M^, 



1 + 0(72''-^) 

2NgS 



M-<L<M+ 



(11.14) 



To see this, we consider the expectation value separately on and ff^. Since the double sum 
contains at most terms and P(J7'^) is subexponentially small (4.19), it is sufficient to check 
(11.14) when the expectations are restricted to the set fl. On the set ft we have 

(1-Cn^-i) ^ Xi< 9{{Xj-\k)Ng) <{l + Cn'+^) ^ X^, (11.15) 

M-<L<M+ M_<j.k<M^. M_<L<M+ 

j'^k 
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where C depends on ||5||oo- This follows from the fact that, by the support of g, only those (j, k) 
index pairs give nonzero contribution for which |Aj — A^j < C/iV, and thus |j — fc| < Cn* by 
(4.22). Therefore the sum Xl contains each pair (j, k) at least [n — Cn''']-times and at most 
[n + Cn''']-times. Taking the expectation of (11.15) on we obtain (11.14). 
Since Ql is bounded by using (11.8), and 

lim lim V 1 = 1, (11.16) 

^ M-<L<M+ 

we only have to estimate Ql for a typical L. Additionally to L G {A/_, A/_ + 1, . . . , we can 

thus assume that L € since the relative proportion of good indices approaches one within any 
index set with cardinality proportional with N and which is away from the boundary (see (6.29)). 
More precisely, we fix two sequences L_ (N) and L+ {N) such that L± (N) & G ~ Gn 

Ql^{N) = minjQL, i e Gn), Ql+(n) = max{QL, L e Gn}, 
then it follows from (11.16) that 

(1 - £jV,<5)QL_(Af) < „^ ^ Ql < (1 + £jV,5)<3l+(7V) 

where lim5_,.o limAr_j.oo ea^.s = 0. We thus have to show that Ql±(n) converges to the sine kernel. 
We will actually prove that Ql converges to the sine- kernel for any sequence L = L{N) G G — Gn- 
The dependence on A'' will be omitted from the notation. 
For L Cz G, wc can compute the expectation as 

according to the convention that E = E/^. Recall that definition of the sets fti = ili{L), = 
il2{L) and ^3{L) from (6. 7), (6. 15) and (6.20). Setting fl := nir\il2<^^3, we see that the probability 
of its complement is P(r2'^) < Cn~^ (see (6.8), (6.16) and (6.21)). Since Xl < Cn, we only have 
to consider external configurations such that y g fl. Thus 



Ql =El{y en)Ef^XL + 0{r 



Xl + J (/y - l)XLd^y 

The second term in the square bracket will be an error term since it is bounded by 
El(y en) J l/y - l\\XL\d^ly < CnEl{y e n) J \fy - Ijd^iy. 

Since y E and L E G, we have 

J l/y - IjdMy < 
from (6.26) and (6.27) and wc thus obtain 

Ql=E l(y e n)E,,^XL + O(n-i). 



(11.17) 
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For the main term, by using (7.6) and assuming that B is large enough, we can also replace 
the measure fiy by its cutoff version ^y^-* with a negligible error. Let = p[^^ :~ P^^^i) denote the 

My 

density and py := p /y^ denote the two point marginal of this measure. Thus we have 

fly 



Ql = in-l)Eliy en) da d/3pf (a, - /?)7V^.) + O(n-i). 



(11.18) 



Since /Xy^'' is an equilibrium measure, its correlation functions can be obtained as determinants of 
the appropriate K kernels, see (8.8). In particular 

0<p(^Hu,v) ^ ^pW(^)pa)(^)„ 1 ^, K\u,v) < gy{u)gy{v) (11.19) 
' n ' ' n(n — 1) 

holds for the marginals of the measure ii"y^ ■ The lower bound on p'^^ follows from the fact that K 
is the kernel of a positive operator, i.e. \K{u, w)p < K{u, u)K{v, v). 

Let < K < 1/10. We now show that, up to an error of order k, the da integration in (11.18) 
can be restricted from ly = j/i] onto 



i; = [yl,y*+] := 



1 - 2k, 



y 



\iyly- 



y 



y-i + 2/1 



i.e. onto an interval in the middle of ly with length (1 — 4K)|/y|. Similarly, the d/3 integration will 
be restricted to 



y =[y- ,y+\ 



1 - K 



y 



lyly- 



1 — K , 



y 



y-i + yi 



i.e. onto an interval in the middle of ly with length (1 — 2K)|/y|. We show how to restrict the da 
integration, the other one is analogous. 

The difference between the full da integral and the restricted one is given by 



CnEl(y e O) 



I VI 

da I d(3p'^y\a,(i)g{{a~ (i)NQ). 
y-i 



(11.20) 



To do this estimate, we go back from the equilibrium measure to fy and we also remove the 
constraint il. As above, all these changes result in negligible errors. Moreover, we can insert ft 
at the expense of a negligible error since P(f2) is subcxponentially small. Thus (11.20) can be 
estimated by 



L<j,k<L-\ 



< 2K(AL+„-Ai))+l(A,-AL > (1-2k)(Al+„-Al)) 

(11.21) 



up to negligible errors. On the set we know from (4.22) that 

Ng{XL+n - Al) = n + 0{n^/'), Ng{\j - A^) = (j - L) + 0{n^/^) 
assuming that 7 < 1/20. Thus the first term in the square bracket of (11.20) can be estimated by 

Cn-^Eln Y^l^L < j < L + 2Kn + Cn^/^^g{{\j - Xk)Ng) <Ck (11.22) 

L<k<L+nj^k 
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taking into account (11-8) as before. Similar estimate holds for the second term in (11.21). Thus, 
restricting the da-integration to /* results in an error of order 0{k). 

Doing the same restriction for the d/3 integral, we can from now on assume that both integrations 
in (11.18) are restricted to /* , i.e. it is separated away from the boundary. In particular, from (10.2) 
and after rescaling, we know that gy{ct) and gy{(3) are essentially constant and equal to |/y|"^(l + 
0(n-'^/i2)^ Moreover, on the set h, we know from (6.13) that \Iy\-^ = ^(1 + 0(nT-i/4)), j ^ 



QyW) = ^(1 + Oin-''/'^)), for any /3 e /;* 



(11.23) 



Since /* C /**, the same formula holds for gy{a) for aU a e /*. 

We now compute the restricted integrals in (11.18). Changing variables from (3 to b with 

/3 = a + b{ngy{a))~^ , we have 



Ql =El(y e n) I da- 



l r(y"-a)nQy(a.) 



ngy(a) J (y*_' -a)nQy{a) 



ngy{a)J' \ngy{a)J (11.24) 



+ 0(n-i) + 0(K). 
Since g is smooth and has compact support, we have 

Vnpy (a) / 



Ng 



ngy{a) 



1 



(11.25) 



from (11.23). Therefore, when we insert (11.25) into (11.24) and use (11.19), the error term 
involving ^ is bounded by 



CEl(y CzVl) da 



1 



gy{a) 



d6p(2) (a,a + ^-—) 

^ \ ngy(a)y 



Ng 



(y"-a}ngy{a) 



(y"-a)ney(a) 



db gy\a 



<Cn-'^/i2El(y (^n) da 
Ji* 

<Cn-'^/i2El(y j gy{a)da j gy{[3)dl3 
using that, by definition, 

£iy(a)da < / £iy(a)da = 1 



ngy{a) 
b 

ngy{a) 



1 



(11.26) 



and similar bound holds for the /?-integral. 

Thus we can replace the variable of p in (11.24) by —6 with negligible errors. Now Theorem 
11.1 states that 



1 



gy{a) 



Py^ (a, a 



1 



Clearly, as n — cxj, 



ngy{a) 
(y^* — a)ngy{a) — > ±oo 



sin 7r6\ 2 



7r6 



47 



for all a G /*, i.e. the integration limits can be extended to infinity, noting that g is compactly 
supported. Finally, from (11.23) we have 

/ Qy{a)da>l-0{K)-0{n''-^). 
Combining all these estimates with Theorem 11.1, we obtain 

gi-b) + Oin~-'/'-') + OiK) + o{l), 

where the last term error term is from Theorem 11.1 that goes to zero as — > oo. Taking the 
N — > oo, (5 — > and k ^ limits in this order, we arrive at the proof of Theorem 2.1. r-i 



f°" „ \ /sinTTfex 



A Proof of Theorem 4.1 

We start with the proof of (4.4) and (4.5). From Theorem 4.6 of [15], we have 

P {\m{x + iy)\ >K)< Ce-'^^^'^^y^ 

for all if > sufficiently large, and \y\ > {\ogN)*/N. Since moreover |m(a; + iy)\ < \y\~^ with 
probability one, we obtain, under the assumption N\y\ > (logiV)^, 

E|m(x + ^y)|9 < K"^ + C\y\-'^e^''VK^\ < Cq 

uniformly in N,x. The bound (4.5) follows because ^y{x) = 7r~^Im ra{x + iy). 

To prove the results about the closeness of m(z) or Em(z) to msc{z), we first recall the key 
identity about the trace of a resolvent in terms of resolvents of minors (see, e.g., (4.5) of [13]): 

TV 



k 

with 



fj-ilz) = — Tr ^ = _L \ \ 1) 

' N H-z N -m{z) - z + 6kiz) 

Sk{z) = huk + m{z) - (l - ^) "^^'H^) - X^'^K^). (A.2) 



and 



N - 1 BW - z ' ' N 



Here 5^=) is the (fcfc) -minor of H (the {N — 1) x [N — 1) matrix obtained by removing the fc-th 
row and the fc-th column from iJ), \^a\^^a^ are the eigenvalues and the eigenvectors of B^'^\ and 
a'*^' = (hki, ■ ■ ■ ,hk.k-i,hk,k+i, ■ ■ - hkN)- Throughout the proof we let x,y denote the real and 
imaginary parts of z = .t + iy. Moreover, we will restrict our attention to y > 0. The case y < 
can be handled similarly. 
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Step 1. Lower hound on |m(z) + z\. There exist constants C, c > such that 

^{\m{x + iy) + {x + iy)\<c) <e-^'^ (A.3) 

for all .T e M, y > (log A^)"'/A^, and for all N large enough (depending only on the choice of C, c). 

To show (A.3), we use a continuity argument. We claim that there exist positive constants 
C'l, (^2, C3, c > such that the following four conditions are satisfied: 

inf \z + m^cizM > 2c, 

zGC\[-2,2] V yi - ) 

\m{x + iy)\> — ] < for all x G M, y > (log NV/N 

2c J 3 

sup \5k{x + iv)\> — ]< for ah .t e M, y > [\ogNf/N 

l<k<N tb/ 6 

V{\m{x + iy) - nis^ix + iy)\ > c) < for ah |a;| <l,y> {logNf/N . 

3 

The first condition can be checked explicitly from (4.3). The second condition follows from the 
upper bound in Theorem 4.6 of [15]. The third condition can be satisfied because of Lemma 4.2 in 
[15], combined with the fact that P(maxi, \ht^k\ < (c/48)) < e"'-"^ and with the observation that 



1 



C 



(z)-(l--)mW(z) <— (A.5) 



Ny 



with probability one (see, for example (2.7) in [14]). Finally, the last condition can be ver- 
ified by Theorem 4.1 of [15]. Note that the last three conditions only need to hold for all 
N > No{c, Ci, C2, C3) large enough. Fix C min(Ci, C2, C3). 

For \x\ < 1, y > (logiV)'*/A^ we have (using the first and the last equation in (A. 4)) 

¥(^\m{x + iy) + {x + iy)\ < cj < P([m(.T + iy) - msdx + iy)\ > c) < e^^'^ . 

Hence (A.3) holds true (with the defined constants c, C) for every \x\ < 1, y > (logA^)^/iV . 
Suppose now that (A.3) holds for a given z ~ x + iy Cz C Then we claim that (A.3) holds true for 
all 

z' = x' + iy' e = {z' e C : [z - 2'[ < DN-"^ , Im z' > (logiV)*/iV} 

for a constant D depending only on c, and for all N > Nq; this implies immediately that (A.3) 
holds true for all a; e M, y > (log N)'^/N and > Nq. 

To prove (A.3) for z' £ B^, notice that |m'(z)| < A^^ for all z e C with Im z > (logA^)''/A 
with probability one. Therefore, using (A.3) for z, we find that 

p(|m(z') + z'[ < I) <e~'^^^ (A.6) 

for all z' £ Bz. Expanding (A.l), we obtain that 

N 



m{z') + z' N ^_^m{z') + z' ^ ' m{z') + z' - 5k{z') 
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Therefore, 



1 1 1 / 1 ^ 

'(-—- r > -) <v( \m(z')\ > —"j + P _ 

\\m{z') + z'\ - cJ - y ^ - 2c) 



\5k{z')\ > 1 



\m{z') + z'\\m{z') + z' - 4(^01 ~ 2c 



^{\m(z')\>^)+v{\m{z') + z'\<^)+f'( sup |4(^')l > 
V 2cJ \ 27 \i<k<N 



< 



where we used (A. 4) and (A. 6). This imphes (A. 3) for z' £ Bz, and completes the proof of Step 1. 

Step 2. Convergence to the semicircle in probability. Suppose that |.t| < K, (iog N)^/N < y < 
1. Then there exist constants c, C, 6o, only depending on K, such that 



¥ {\m{x + iy) - rriscix + iy)\ > S) < C e^'^V^!' |2-bll (AJ) 

for all S < So, and all > 2. 

To show (A. 7), we first observe that, by increasing the constant C, we can assume N to be 
sufficiently large. Then we expand (A.l) into 

f f ^ f 1 

m[z) + z iV ^ — ^ m\z) + z m[z) + z — Ok(z) 



We define the complex random variable 

N 

From (A. 3) and since, by Theorem 4.2 of [f5]. 



1 ^ 1 



P ( sup \5k{z)\ > (5 ) < C'e-^™'"^*^''^'^^) (A.9) 

\l<fc<Af / 

for all y > (log N^/N and 6 > 0, we find 

P(|r(z)| > 5) < P(|to(z) + z| < c) + P f sup 14(^)1 > min f^,^)) < Ce'"^^ (A.IO) 

\k<N V 2 2// 

for S small enough, y > (log A^)'^/A^, and N large enough (independently of 6). 
To prove (A. 7) for |a;| < 2, wc use that, from (6.14) in [15], 

CS 



m + z 



< 6 => \m — niscl < 



(2-N)V2 

for all z = a; + with < 2 and < y < 1. This implies, using (A.IO), that 

P (|m(z) - m,c{z)\ >S)<P {\Y{z)\ > c6{2 - \x\y^^^ < Ce^=V^y(2-l^l) 
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for all S small enough, N large enough, < 2, (log N)'^/N < y < 1. 

It remains to show (A.7) for 2 < < K. To this end, for (log N)'^/N < ?/ < 1 and 2 < < isT, 
we consider the event 

n* = {\m{z) + z\> c, sup 14(2)1 < S, \Y{z)\ < Sx/\2 - \x\\\ , z = x + iy . 

^ l<k<N J 



From (A.3), (A.9) and (A.IO), we have P([fi*]^) < e-'=V^y|2-bll for all S small and N large 
enough. Solving (A. 8) for m on the set fi*, we get 



z Y(z) /z2 zY(z) Y(zY 

m(z) ^ — + \/ 1 — ^ . 

^ ' 2 2 V4 2 4 

Since m(z) is the Stieltjes transform of an empirical measure with finite support, it is analytic away 
from a compact subset of the real axis. Similarly, on the set £7*, Y{z) is bounded and analytic 
away from a compact subset of the real axis. The square root in the above formula is therefore 
uniquely defined as the branch analytic on C\(— oo,0], characterized by the property that the real 
part of the square root is non-negative. Hence, on f2*. 



Y{z) z^ zY{z) Y{z) 
m{z) - msc{z) = — + Y ^ - 1 — + 

Y{z) 1 2zY{z)-Y{z) 



2 



2 



2 4./^_^_^^y^^y^ 
using the explicit formula (2.7) for •msc(z\ and therefore 

, „^ |y(^)| , 2|z||r(z)| + |y(z)|2 ^^ |r(z)| + |y(z)p 

\m{z)-'m,c{z)\<——^' , , <G , ,, — (A.ll) 

^ 4Rey'^-l \J\^-\A\ 

using the fact that 



Re y — - 1 > C|2 - |a;|| for all 2 < |x| < K, \y\<\. 

From (A.ll), we obtain that 

P (|m(z) - msc{z)\ > 5) < P (|r(z)| + |y(z)|2 > cl^j\2-\x\^ < e-'^WNv\2-\x\\ 

for all 5 small enough, 2 < |x| < K, (log N)*/N < y < 1, and N large enough. 

Step 3. Fluctuations ofm{z). Suppose that |x| < (logN)'^/N <y<l and Ny\2 - \x\\ > 
(log A^)^. Then there exist constants C, c > such that 



P (|m(z) - Em{z)\ > S) < Ce-"VJVy|2-|:i|| ^^ -^2) 

for all < 6 < Sq, with Sq small enough and all N large enough. 
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To show (A. 12), we observe first that 

\Em{z)-msciz)\<E\m{z)~'m,,ciz)\< / di P (|m(z) - msc(2)| > t) 



where we used that |'ti(2;)| < y ^- Using (A. 7), we obtain 



\Em{z) - mseiz)\ < ^ , , + -e^V ^'y\'-\-U < ^ (A.13) 

VNy\2-\x\\ y ^Ny\2-\x\\ ^ ' 



for N large enough. For 5^ Ny\2 — \x\\ > 4C wc thus obtain 

P(|m(z)-Em(z)| > (5) < p(|m(z) - m,e(z)| > ^) < C g-'^V^i'l^-l^ll 

where we used (A. 7) again. For S^yNy\2 — \x\\ < AC the bound (A. 12) is trivial. 

As a consequence of (A. 12), we immediately obtain (4.7). If Ny\2 — \x\ \ < (logiV)*, we directly 
use (4.4). Otherwise, we use 

E \m{z) - Em(z)|'? < C, / " tP-^¥{\m{z) - Em(z)| > t)dt + Cy-«e-"''»V^J'|2-l^ll 



from (A. 12), and we obtain the first term on the r.h.s. of (4.7). 

Step 4- Convergence to the semicircle in expectation. Assume that \x\ < K, {log N)'^/N < y < 1 
and Ny\2 - |x|| > (log TV)''. Then 

'^-(-)---(-)'^ iV,|2-N|3/2 (A.14) 

for a universal constant C. Note that this bound gains an additional {Nr])~-^/^ factor on the 
precision of the estimates compared with Step 2 and Step 3, but the negative power of |2 — |a:;|| has 
increased. 

To prove (A.14), with cq := inf^ \msciz) + z\ > 0, we have 

C 

\Em{z) + z\> \msc{z) + z\ - \Em{z) - msc{z)\ > cq - > ^ (A.15) 

^/Ny\2-\x\\ 2 

for N large enough (here we used (A.13)). Expanding the denominator in the r.h.s. of (A.l) 
aroimd Em(z) + z, wc find 



m(z) = — — — -= — — -TT (m(z) ~ Em(z) + Sk(z)) 

^ ' Em{z) + z TV ^ (Em(z) + z)2 ^ ^ ' w -r /.v 



fc=i 

N 



(A.16) 



(r.(.)-E..(.) + 4(z))-. 



iV ^ (Em(z) + 2)2 ^ " m{z) + z-Sk{z) 

Taking expectation, we find 

Ern(z) + z (Em(z) + z)^ 



1 



(Em(z) + z)2 



{m{z) - Em{z) + 5i{z)f ^ 



m{z) + z — '^i(^) 



(A.17) 
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With a Scliwarz inequality, we get 



Em(z) 



1 



< 



1 



|Em(z) + z|2 
+ 2 



|E(Si(z)| 



m(2)-Em(0)|4+E|,5i(z)|4)'/' 



From (A. 12). we find 



|Em(z) + z\ 



E|m(z) -E77i(z)|'^ < 



E 



|m(z) + z — (5i(z)p 



Ca 



{Ny\2 - IxW)"/^ 
for arbitrary q > 1. Moreover, with c fixed in (A. 3), we have 

P (|to(z) " ' ' ^ 

using (A. 9), and hence 

1 



1/2 

(A.18) 
(A.19) 



+ z - 5i{z)\ < 2 j < n\rn{z) + z| < c) + P (\6i{z)\ > | 



< e 



< e 



-C(log Ar)2 



E 



|m(z) + z - (5i(z)|« - 2/9 



1 o 99 99+1 



(A.20) 



if N is large enough. Here we used the fact that Im m{z) + z — Si{z) > Im z = y. From (A. 9), we 
also have 

From the definition of Si{z) in (A. 2), from EAT'^'^) ~ and from (A. 5), we get 
Combining this bound with (A.19), we find, from (A.18), that 



(A.22) 



Eto(z) 



Em(z) 



< 



C 



Recall that msdz) solves the equation 

msc(z) 



1 



rrisciz) + z 



Ny\2- 



= 0. 



(A.23) 



This equation is stable in a sense that the inverse of the function m — > + (to + z) ^ near zero 
is Lipschitz continuous with a constant proportional to |2 — Thus we obtain 



|Eto(z) - msc{z)\ < 



C 



Ny\2^\x\\y^ 



and this completes Step 4. 



Step 5. Alternative bound on |Eto(z) — TOsc(z)|. Assuming < K and (log A'^)^/A^ < 2/ < 1, 
there exists a constant C > such that 



|Em(z) - msc(z)| < 



C 



7Vy3/2|2- |a:||i/2 



(A.24) 
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for all N large enough (independently of z — x + iy) . 
To prove (A. 24), we use the bound 

E|m(z) -Em(z)|'' < 



(Ary3/2)9 



(A.25) 



which is valid for all g > 1 and it follows from Theorem 3.1 in [13]. Expanding again the denomi- 
nator in the r.h.s. of (A.l) around Em(z) + z, we get 



m{z) 



1 1 ^ 

- Iv 

N ^ 



m{z) — Em(z) + 



E?n(z) + z iV ^ [Em(z) + z] [ni{z) + z- 4(z)] 



1 1 ^ 



•m[z) — Em(z) 



m(z) + z N ^ [Em(2) + z] [ni{z) + z- 4(z)] N (Eto(2) + z) 



1 ^ 
"/V 5Z 



4(z) 



(A.26) 



N 



N [Em{z) + zf [m{z) + z ~ Sk{z)] 



Taking the expectation, we find 
Em(z)- ^ 



l,m[z) + z 



< 



r^T r (lE,\miz) -Em{z)fy^^ (e 



\Em{z) + z\ 

|EJi(z)| , (£1-51(^)1^ 



1 



\m{z) + z- (5i(z)|^ 



1/2 



|Em(z) + z|2 |Em(z) + z|2 \^ \m{z) + z ~ 6i{z)\ 



E- 



1 



1/2 



(A.27) 



E\Si{z)\^y^'^ (E|m(z) - Em(z)|^ 
|Eto(z) + zp 



,1/4 



E 



1 



|m(z) + z — (5i(z)p 



1/2 



Using (A.20) with g = 2, (A.25) with g = 2 and g = 4, (A.21) with q ^ i and (A.22), we find, by 
the stability argument, that 



Em(z) 



1 



< 



C 



im{z) + z ~ Ny^/^ 
which implies (A. 24). This completes the proof of Theorem 4.1. 



□ 



B Proof of Proposition 4.2 

We start with the proof of (4.14). From the moment method, we know that if Ainin(-H^) and 
^miix{H) denote the smallest and the largest eigenvalues of the hermitian Wigner matrix H, and 
if K is large enough, then 

P(A,nin(i^) < -K) = P(A„,ax(i^) > K) < A'"^^''' 

(for example, one can use the bound ETriJ^ ' < C from [28]; the symmetry condition on the 
distribution can be removed by symmctrization). This implies that DT(£') < NK~'^^ ^ for E < 
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-K and 1 - m{E) < NR-'^^^"' for all E > K . Therefore 

/oo pK 
\m{E) ~ msciE)\dE < \[m{E)-m{-K)]-msc{E)\dE + 2NK-''^ (B.l) 
oo J-K 

for K > large enough. The last term is negligible. The main estimate is contained in the following 
lemma whose proof is given at the end of this section. 

Lemma B.l Let g* — ~ p_ be a difference of two finite measures with support in [^K, K] for 
some K > 0. Let ^ 

m*{z)= ( ^*{E):= [ g*{<lx) 

J-g X — Z J-K 

he the Stieltjes transform and the distribution function of g* , respectively. Denote moreover by 
rrij- {z) the Stieltjes transforms of g*^ . We assume that m* , m*^ , ml satisfy the following bounds 
for \x\ <K + 1: 

\ml{x + iy)\ + \m*_{x + iy)\ < Li for all (log N)'^ / N < \y\ < 1 (B.2) 

K(^ + *y)l<]^^^37i|^^ ^"""^^ (iogA^m<H<i (B.3) 

\m*ix + iy)\ < ^1^113^' |^||3/2 for all 7V|2/| |2 - |x| | > (log iV)^, (B.4) 

with some constants Li, L2, L3. Then 

/K 
^\m*{E)\AE<j^. (B.5) 

with L ~ maxjii, is}- The constant C in (B.5) depends only on K. 

We apply this lemma for the signed measure g*{Ax) = 1(|2;| < K)\^g{x) — gsc{x)\Ax. The 
bounds (B.2), (B.3), and (B.4) follow from (4.4), (4.8) and (4.9) (choosing K + 1 instead of K), 
respectively. From (B.5) we obtain 



-K 

which, together with (B.l), completes the proof of (4.14). 

For the proof of (4.13), we fix \E\ < K and we choose N~^/^ < 77 < 1 to be optimized later. 
Define a function / = Z^; : R — >• R such that f{x) = 1 for x < E — rj, f{x) = ioi x > E + t] with 
l/'l < Ct]-^ and |/"| < C77-2. We have 

(B,6) 

The second term on the r.h.s is estimated by Crj, using (4.5). For the first term we use Theorem 
4.6 of [15]: 



^(m^^lElA > s/4) < Ce~^^ (B.7) 
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with some positive c > 0. 

Now we consider the fluctuation of the smoothed distribution function 



N 



/oo 1 
fEWHdX) - g{X)dX] ^^Y. [M^-^) - E/£(Aa)]. 

We partition [—K — 2, K + 2] into intervals 1^ of length ?/. For Af > Mg with a sufficiently large 
Afg, and set 

Qk ■■= |fcA/A^?7 < sup7V(/r) < {k + l)MNr]Y fc = 0, 1, 2, . . . , 
then from Theorem 4.6 of [15] we know that 



Analogously to the calculation (D.16), the size of the variance of W is determined by the size of 
iVVFj. On the event 51^, we have 



dW |2 



N ^ I aRe h 



|2 


dW 


+ 


dim hij 



l<i<j<N 

2 



l<i<j<Af Q 



„ kM 

|2 <• 



(Note that the derivative in T/W is with respect to the original random variables Zij = VlVhij). 
From the concentration inequality (Theorem 2.1 of [4]) we obtain that 

nW > S/4) < e-^*/4 E e^'^ 
i/n 

<Ce-^s/i ^ gST^fcMTV-^-i (B.9) 

fc=0 
k=0 



Choosing T = cN^/"^, and -q = N^'^/*, it follows that 

¥{W > (5/4) < e 



Repeating the same argument with W replaced by —W, we conclude that 

P(|M^| > (5/4) < e-"''^. 
Combining this with (B.6) and (B.7), we have 

P(|0^(£;) - msciE)\ >S)< Ce-""^^. 
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which completes the proof of Proposition 4.2. 



□ 



Proof of Lemma B.l. For simphcity, in the proof wc omit the star from the notation. First 
notice that (B.2) impUes that, after taking imaginary part, 



m)<cL,\i\ 



(B.IO) 



for any interval of length |/| > {\ogNf/N, I C [~K - 1, if + 1]. 

Let (logiV)^/A^ < 77 < 1 to be chosen later. Fix E e [—A', A'] and define a function f = fs '■ 
R R such that f{x) = 1 for 2: g [-K, E - rj\, f{x) = for x > A + 77 and x < -K - 1 with 
l/'l < C?7~i and |/"| < C77-2. We have 



/OO 
fE{\)Q{d\) <\g\{E-r],E + r])<CLir]. 
-00 



(B.ll) 



To express /(A) in terms of the Stieltjes transform, we use the Helffer-Sjostrand functional calculus, 
see, e.g., [8]. Let xiv) be a smooth cutoff function with support in [—1,1], with xiu) — 1 foi' 
\y\ < 1/2 and with bounded derivatives. Let 



then 



- 2^ 



dgf{x + iy) 1 

— axay ~ — 

^2 A — X — ly 2tt 



f{x + iy)^ifix)+iyf'{x))xiy), 

W.f"{x)x{y)+i{f[x)+iyf'{x))x'{y) 



X — X — iy 



dxdy, (B.12) 



and therefore, since / is real. 



/(A)^,(dA) 



Re 



< 



f{\)Q{A\) 

) 

/ yf"{x)x{.y)^'m.m(x + iy)(lxdy 
+ C I {\f{x)\ + \y\\f'{x)\)\x'{y)\\m{x + iy)\dxAy. 



(B.13) 



Using (B.3) and the support properties of x' and /, the second contribution is bounded by 

^^+'' dx 



As 



dx 



\x\<K+l 



, [\I{x)\ + \y\\f'{x)\] ^ CL2 CL2 

<\y\<i ^ iV|2;|3/2|2-N|i/2 - TV + iV77 

CL2 



E-ri 



11/2 



(B.14) 



<- 



■iV|2- |A||i/2 



For the first term in (B.13), we split the integration: 
yf"{x) x{y) Imm(a; + iy)dxdy 



< C 



\y\<V^\x\<K+l 



\y\\f"i.x)\{\lmm+{x + iy) \ + \Imm-{x + iy)\)dxdy (B.15) 



yf"{x) xiy) Mx + iy)dxdy 



';<|y|<l "'|an|<-ff+l 
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where, in the second term, we dropped the imaginary part since / and x S'l'^ real. To bound the 
first term we note that, for every fixed x, the functions 



\y\\lmni±{x + iy)\ = j p±(ds)- 



[s — xY + y^ 

are monotonicahy increasing in \y\. This imphes that, for all \y\ < 77, 

|?/||Imm±(2; + iy)\ < T]\lmm±{x + iri)\ < Crj 
by (B.2). Therefore, wc find 

\y\\f"{x)\{\Imm+{x + iy)\ + \lmm-{x + iy)\)dxdy < CLn]. (B.16) 

\y\<vJ\x\<K+i 

As for the second term on the r.h.s. of (B.15), we integrate by parts first in x, then in y. It is 
sufficient to consider the regime t] < y < 1, the case of negative y's is treated identically. We find 



/ / iyf"{x)x{y)m{x + iy)dydx 

Jn<v<i J\x\<K+i 



(B.17) 



iy f {x)x{y)m' [x + iy)dyc\x 

■n<y<l J\x\<K+l 

dy{yx{y))f'{x)m{x + iy)dydx 

'n<y<l J\x\<K+l 

77/'(x)x(?7)™(a; + ir/)dx. 

\x\<K+l 

Using (B.2), the second term is bounded in absolute value by 

CLi I T]\f{x)\dx<CLiri. (B.18) 

J\x\<K+l 

The absolute value of the first term on the r.h.s. of (B.17) is estimated by 

Cr]-^ / / \m{x + iy)\dxdy. (B.19) 

J 7} J E — 7] 

Putting all terms together, we find from (B.ll), (B.13), (B.14), (B.16) and (B.18) that 

/ \'n{E)\dE <CL(r] + N-^ + f dy f dx \m{x + iy)\) . (B.20) 

J\E\<K ^ Ji] J\x\<K+l ' 

We will use the bounds (B.2)-(B.4) and we split the integration into separate regions: 
dy / da; \'m(x + iy)| 

J\x\<K+\ 

(logA^)' 



<CL J dy J dx l(^|2 - \x\\ < 



Ny ) (B.21) 
CL I dy I dx min ( 1, ■ 



|2;|<A'+1 



iV|2/|3/2|2- |a;||V2' 7V|y||2- |a;||3/^ 



CL (I + II) . 
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Clearly 



Ny J - N 

As for the second term on the r.h.s. of (B.21), we divide the integral into several pieces: 



II < / dy 



f 1 

da; + / dx j j — --r-r^ 

|2-|x||<(Ary)-2/3 J(Ary)-2/3<|2-|.T||<K+3 Ny\2-\x\\' 



dy 



< 



Ar-4/7 



da; 



1 



da; ■ 



\2-M<y^/- Nyy^\2 - \x\\^/^ Jyi/2<\2-\,\\<K+3 Ny\2-\x\\y^ 



1 



+ 



dy- 



1 



(B.22) 



independently of rj. Inserting in (B.20), and choosing rj = N we conclude the proof of (B.5). 



C Proof of Lemma 4.3. 

We partition the interval [—2 + k, 2 — k] into a disjoint union of intervals 

I. 



n<N~^{r~-),n^N~^{r+-) 



2'-- ■ 2'J (^-^^ 

of length rCN~^ and center w,. = rrC N^^ , where r € Z, \r\ < ri := Nn~'^{2 — k). Then, for any 
r, 

|AA(/,) 



gsc{wr)\ < n '^l^ 



(C.2) 



by (4.16). To prove (4.21), first we locate middle eigenvalue. Let rg be the index such that 

N 



r<ro 



in other words 



^Njl e iro- (C.3) 

For definiteness, we can assume that ro > 0. Using the second event in (4.18) we obtain that 

rn-l 



E -^(^'■) ^ - A/'[(-oo, 0)] < CiVn' 



7/6 



(C.4) 



T = l 



On the other hand, with the notation r\ := min{(ro — l)+,iV7i '''}, we have by (C.2) that 



ro-l 



(C.5) 
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where we used that Wr < 1 for any r < ri < Nn and thus gsc(w,.) > £'sc(l) > c. From (C.4) 
and (C.5) we conclude that vq < CNn^'^'^'^ , i.e. Wr„ < Crr'il'^. Thus we proved that 

|u.,J <Cn-T/6, IAat/sI < Cn-'/e. (C.6) 

Starting the proof of (4.21), wc can assume that a > N/2 by symmetry. Suppose first that 
Xa £ [^2 + K, 2 — k], i.e. Xa £ Ir for some \r\ < ri, i.e. a > N/2 implies r > vq. Then we have 

7'— 1 r 
u—ro + l u—rQ 



r-1 



^i^u) <a^N f Qsc{E)dE < ^{lu) (C.7) 

u=ra + l J-oo u=ro 

using QsciEyiE = 1/2. 
Note that 

^ ^f{Iu) < n^(l + ^ < _^ Cn-^/6)Ar / g,,(£;)d£; (C.8) 

u—ro u—ro ^ 

using (C.2) (C.6) and that 7 is small. Similarly 

^ AA(4) > (1 - Cn-^/6)7V / ^7,,(^;)d^; - Cn\ (c.9) 

«=ro + l 

Thus, combining these estimates with (C.7), we have 



Qsc{E)dE 

-00 



\^-^{aN-^) -Wr\< CK-^l'^n-^l'' 

using (11.10) and k^/^ < aN^^ < 1 - K^/a gi^ce Xa S /r, i-e. |Aa - Wr\ < n'N~'^, we obtain 
(4.21). Finally, we consider the case when Xa > 2 — k. The lower bound in (C.7) and the estimate 
(C.9) hold with r = ri so we get 



a > (1 - Cn-'^'^)N / Qsc{E)dE ~ Cv? 

(CIO) 

> iV / Qsc{E)dE - CrT^'^N > (1 - ■k-^h^'^)N - CrT^'^N, 



which contradicts the assumption a < N{1 — tc"/"^) for large N . 

For the proof of (4.22), suppose that Xa G Ir, Xb £ Is- Using (4.21) and Nk^/^ < a < b < 
N{1 — K^^^), we know that —2 + k/2 < Wr < Wg < 2 — k/2. By (4.21), we have the apriori bound 

\Xa - Xb\ < \m-^\aN-') - m-^\bN-')\ + C^vT-'l^ < C^N-'\b - a| + C^n-^/e < C^n^^/^ 
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by the assumption |6 — a| < CNrP^^ . In particular 

The constants depend on k as < Ck}!"^ . 
From Aa G /r, Afc G Is it also follows that 



and 



[s-r- \)n'^N-^ < A;, - Aa < (s - r + 1)71'' iV~^ 



(C.ll) 



(C.12) 



(C.13) 



Let s — r + 1 = X^j'lo j < mi < ... be the binary representation of 



r + 1 with 

jo — [log2(.s — 7' + 1)] < logiV. Using this representation, wc can concatanate the intervals /„, 
r < u < s, into longer intervals Jq, Ji, . . . of length \ Jj\ ~ ^^rON^^ such that 

s io 

U—T j — Q 

Since g'g^ is bounded on /, we have 

Qsciw) < (1 + Ci^\I\)gsc{Xa), for any w e I. 
On the set fl we thus have (see (4.16)) 

s io jo 



J2mu) ^Y.^{j,) < {i+cM\)Qsc{Xa)J2N\j,\ i + (iv|j,i)- 



l/4jj7/12 



3=0 



3=0 



(C.14) 



< 



(1 + CMDQsciK) [n\I\ + (7V|/|)3/4n^/i2 log TV 



Similary, one can get a lower bound on X^u^r+i -^i^u)- Recalling |/| = (s — r + l)n^N ^, and that 
|/| < fj.oj^ (C.ll), we conclude from (C.13) that 

(b-a)- g.MaW{s - r)| < C^n^\b ~ aj^/^ + C^N-^\b - + C^n-^ . 

But from (C.12) 



Ngsc{Xa){Xb - Aa) - gsciXaWis - r) < C^n^ 



thus 



\Ngsc{\a){Xb -Xa)-{b-a)\< C^n^\b - aj^/^ + C\N-^\b - a\' 
with Ck < Ck^/^, and we have proved (4.22). □ 
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D Proof of Lemma 5.3 



We start with the outhne of the proof and indicate the origin of the restriction a > 1/4. We 
will first regularize the logarithmic interaction on a scale 77 at the expense of an error of 0{ri) for 
each pair of eigenvalues, modulo logarithmic corrections (Lemma D.l). By a Schwarz inequality 
(D.18), the fluctuation of the regularized two body interaction is split into the product of the 
fluctuation of the regularized potential (D-14) and the fluctuation of the local semicircle law 
regularized on scale 77. The latter is of order 0{N~^^'^ri~^^'^) by the improved fluctuation bound 
on the local semicircle law (4.7). The former is of order 0{N~^ri~-^/^) using that the logarithmic 
Sobolev inequality (2.4) on the single site distribution can be turned into a spectral gap estimate 
for Ax- Finally, we optimize the regularization error 0{r]) and the fluctuation error 0{N' 
per particle pairs, which gives a total error of order ■ N~^^^ = iV^+^Z^. 



-3/2„-n 



The proof of the following regularization lemma is postponed until the end of the section: 



Lemma D.l Let {log N)'^/N <r/<l, then 



k + ivl 



< C-q logN 



(D.l) 



'j<k ]<k 
with respect to any Wigner ensemble whose single-site distribution satisfies (2.6) and (2.5). 

Then Lemma 5.3 directly follows from the following statement: 
Lemma D.2 Suppose rj = iV~'^/'*, then 



^E5^1og|A.-A, 



. , 1 

3 + m - 2 



\og\x - y \ gscix)gsciy) dx dy 



< C 



AogN 
77374 



(D.2) 



for a universal constant C > and all N large enough. 



Proof of Lemma D.2. Recall that uj{dx) denotes the empirical measure of the eigenvalues (4.1). 
We have 

^log |Aj - Aj +Z77I - — / log\x - y + ir]\uj{dx)uj{dy) <A^|log77| 

because of the contribution of the diagonal terms. 

Step 1. Recall the dcflnition of CL>^(a;) from (4.2), then 

E / \og\x-y + ii^\uj{dx)uj{dy)-E / dxdy\og\x - y + ir]\uj^{x)uj^{y) <Cr]{\ogNf . (D.3) 



To prove (D.3), we observe that 

Lo{dx)Lo{dy) log \x - y + i-q\ - I dxdy log \x - y + i7]\ujjj{x)uj,^{y) 



= / w(da;)w(dy) / dtdr 



(t — x)^ + rp (r — yY + if 



(log — y + ir/l — log \t — r + iri\) 
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Clearly 

E J uj {dx)uj{dy) log \x — y + ir]\ — E J dxdylog |a; — y + i7y|w,j(a;)a;^(j/) 
+ C77|log77|. 

(D.4) 

Here we also used that Pjsuppo; e [— i^T, A']} > 1 — e^*^^ for some large constant K. Next we 
observe that 

dMr .f-/ |log \x-y + iTj\- log \t - r + ir,\\ 

7o y (i-a;)2+772 (r - y)^ + i]^ \s{t - r) + (1 - s){x - y) + iri\ 

Jo J V + l r- + 1 |s77(i - r) + (a; - + Z77| 

Inserting this bound back into (D.4), we find 



E j uj{dx)uj{dy) log \x — y + iri\ — E j dxdylog\x — y + i'i]\uJri{x)LUri{y)^ 



<Crj\logv\+Cvl ds |dMr(K| + M)Ml^i|ZLiE^^ 
< C77(log7V)^ 



(t2 + l)(r2 + 1) Ar2 ^ |A, - + sri{t - r) + m/| 



(D.6) 

Here we used the bound 

which holds uniformly in x G M, if 77 > (log Af)"'/A^. To prove (D.7), consider the event 

60 = {max|Aj | < ATo} (D.8) 

3 

for some Kq > 0. Moreover, define the intervals Ik = [—{k + l)rj,—krj] U [krj, [k + 1)?]], for all 
nonnegative integer k < Kq/t], and consider the event 

Qi^Wh <KNrj, k = 0,l,2,...,Korj-'}. (D.9) 

For sufficiently large Kq and K we have 

F(eE;) < e-"^^^o, P(ej) < e^"^-"^^ (D.IO) 



63 



by Lemma 7.4 [13] and by (4.20), after adjusting c. Then 



— y 



- m + a;| + 1)77 



(|fc-m + a;| + l) 



<C|log77| 

(D.ll) 

because iVjy > (logiV)^ by assumption. This completes the proof of Step 1. 
Step 2. Let g,]{x) = Kuj,-i{x), and assume (Nrj) > (log A^)^, then 

/ 1 ^1/2 



E / \og\x - y + ir]\ujrfix)uj^{y)dxdy - / log\x - y + iri\grf{x)grf{y)dxdy 



< C 



(D.12) 



We note that 

E [ \og\x - y + i-q\ujr^{x)ujr^{y)dxdy - j \og\x - y + if]\Q.rj{x)Qr^{y)dxdy 

log \x-y + i'q\{iOrj{x) ~ Qr^{x)){iOrj{y) ~ Qrj{y))dxdy (D.13) 
= E y da; {A^ - EA^) (w„(.t) - ft,(a;)), 
where we defined the random variable 

Ax-= I dy\og\x-y^ir^\uj,^{y)^^^f,^{Xj~x) (D.14) 

with/„(A) = (log|-|*0^)(A). 

To estimate the fluctuations of we use that the logarithmic Sobolev inequality (2.4) implies 
the spectral gap, i.e., we have 

E|^:, -EA^P < S-ElV^^p. (D.15) 
Let Ua denote the orthonormal set of eigenvectors belonging to the eigenvalues Aq, of H . Taking 



64 



into account the scaling (2.1), we have 



l<i<j<N 







2 

+ 


dA^ 


2 




dRehij 




dluihij 





a,f3 



c 



using that \f^{X)\^ < C{X^ +r]^)-^. We have from (D.15), (D.16) and (4.5) that 



C 

1p^' 



(D.16) 



(D.17) 



On the other hand, from (4.7) and w,,(x) = tt ^Im m(x + iri) we have 



E|Lj„(a;)-^)„(x)|« < 



(A^?7|2- |x||)«/2 



+ C,1(A^7,|2- |x|| < {\ogNf) 



for all q > 1 and for |.t| < K with some large constant K . 

In order to insert this estimate into (D.13), we need to extract the necessary decay for large x 
from io,-i{x) — Qriix). For |a;| > 2A'o sufficiently large and for any 5 > 1 we can estimate 



Ec.^(x) <77-''El(|A,„ax| > |x|/2)+El(|A,„ax| < \x\/2) [^Y. 



(Ac -a;)2 + r/2 



< rj' 



\x\^i - I^P? 



Inserting the last three equations into (D.13) with g = 2, we find 



E / \og\x - y + iT]\uj,j{x)uj,j{y)dxdy - / \og\x - y + iT]\g,j{x)gr^{y)dxdy 



1/2 



l^nix) - gr,{x)\' 



< y da: [E\A,-EA,[ 

CKq C(logjV)^ 
^3/2^+ ^2^3/2 ■ Ar,,i/2 



1/2 



< 



C 



dxy^E Lo^ix) 



(D.18) 



< 



C 



dx 

This completes the proof of Step 2. 
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step 3. We have 

dxAy log \x-y + J?y|£',,(a;)ft,(y) - / AxAy log \x - y + ir]\ Qsc{x)Qsc{y) 



< CN-^/'^ + C-q. 

(D.19) 



To prove (D.19), we write 

AxAy \og\x - y + ir]\Qy^{x)Qy^{y) - I dxdy \og\x - y + Qsc{x)Qsc{y) 



AxAy log |x - y + i-q\{Q,^{x) - gsc{x))Q,^{y) 
+ / d.Tdy log \x-y + iv\0sc{x){gr,iy) - gsdv)) 



(D.20) 



To control the first term on the r.h.s. of the last equation, we recall that 



mix) = ^ E AA(-oo; x), m^cix) 



gsc(t)dt 



denote the expected number of eigenvalues up to x normalized by TV (integrated density of states) 
and the distribution function of the semicircle law. Note that ^{x) — Vtsc{x) vanishes at a; = ±00. 
Introducing *Tt,,(x) := grj and integrating by parts we find 



dxdy log |x - y + ii]\{g,j{x) - gsc{x))g{y) 



dxdy \og\x ~y + — (ai^(x) - 91sc(a;)) g{y) 
dx 

dxdy-^^-^^ (m^ix) - 91,,(x)) g{y) 
[x — yY + 

d.TRe ETO(a; + i7?) {^^{x) -^scix)) 



(D.21) 



< sup E |m(a; + ny)! / da: D'I,,(a;) — 91sc(a;) 

X J 

From the upper bound (4.4) on |E m(a: + iry)| and from 



dx 



msc{x)-{'yisc*e^){x] 



we find, by (4.14), 

dxdy log |x - y + ir]\{g^{x) - gsc{x))g{y) 



<Ct] + C dx 



m,j{x)^{msc*o^){x)\ 

<Crj + C J dx\%x) - m^cix)] 

The second term on the r.h.s. of (D.20) can be bounded similarly. This completes the proof of 
Step 3. Combining the estimates in Step 1-3 and choosing ry = N~^/^, we finish the proof of the 
Lemma D.2. q 
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Proof of Lemma D.l. We split the summation into three parts: 



log |Aj - Afc + i8\ = Yi (S) + ¥2(6) + ¥3(6) 



for any Q < 5 < rj with 







- Afe 1 > 77) log 


\\j-\k + i5\ 










y2{5) 




^"<|A,-A.. 


< 7j) log|Aj - Afe - 
















)log|A-, - Xk +iS\ 




0<k 







(D.22) 



We have 



E |yi(ry) - Yi(0)| < E ^ 1(|A, - Xk\ > v) f' ds 

<E^1(|A,-Afc| >7,) / d,s— 
77i. J a |Aj 



log |Aj - Afc + isri\ 



j<k 
j<k 



Xk + isr]\ 



(D.23) 



\Xj - Afe + zjyl 



< CN'^r]\logN\ 

by (D.7). For the ¥2 term, we remark that, for arbitrary < 6 < r/, 

|l2(<5)|<Clog7V^l(|A,-Afe|<7y). 

]<k 



(D.24) 



To bound the r.h.s. we consider the events 9o, 6i from (D.8), (D.9) with sufficiently large K and 
Kq so that (D.IO) holds. Then 



E\¥2iS)\ <CN^ (log N) (P(eEj) + p(ej)) 



C(logiV)E 



1(60 n 61 ) ^ A/}, (AA,,_, +Nk+ A/},+, ] 



fe=0 



(D.25) 



< CN^ilogN) e-=(i°s^)' +C{logN)r]'\Nijf 

< CN^ (log N)tj 



for every < 6 < rj. Finally, for the Y3 term we use the level repulsion estimate (E.5) from Theorem 
E.3, which implies that for any interval I = [E ~ e/N, E + e/N] with E eM. and < e < 1 



Let 



Jr 



[Ml > 2) < Ce^iV^^ 
r — 1 r + 1 " 



r e Z 
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be overlapping intervals covering M. We can then write 

■)— 771— 1 



{■I III, J. 'I IIL -v 

j<k rGZ m=0 

We split the interval J,- into overlapping subintervals of length 2^'"^^'^ N~^^ by defining 



r- 1 



r- 1 s + 2 



< s < 2™+! - 2 



Then 



TV 



10 



< I A, - Xk 



< 



s=0 

(2"i-i7V9)4 



< C2-''"Ar 



3m Tvr— 18 



For large \r\ > KN^'^, we can also use the bound 



(D.27) 



P{Aj e /J < Ccxp ( - cN{N~^°rf), 

that follows from the trivial large deviation estimate for the largest eigenvalue (Lemma 7.4 [13]). 
Inserting these last two estimates into (D.26), we have for every < 6 < r] 



E|r3(<5)| < CiV^ ^ (to + log iV)[ 2-3™TV-i**+ cxp{-cN{N-^"r)^) 

m=0 |r|<r* |r|>r* 

< CiV' 2 (log iV), 



(D.28) 



where r* ^ K \og{m + 2) for brevity. Combining (D.23), (D.25), and (D.28), we obtain (D.l). 
□ 



E Level repulsion near the spectral edge 

We need to establish a Wegncr-typc inequality, and bounds on the level repulsion in the same 
spirit as in Theorem 3.4 and Theorem 3.5 of [15], for energy intervals close to the spectral edges. 
Since we only need these bounds for very small values of £ ~ N^"', we are not aiming at the most 
general result here. The statements we present can be proven by simply replacing, in the proof of 
Theorems 3.4 and Theorem 3.5 of [15], the convergence to the semicircle law stated in Theorem 
3.1 of [15] with Theorem 4.1. Recall that Theorem 3.1 of [15] is valid up to the smallest possible 
scale rj > K/N but only away from the spectral edges, while Theorem 4.1 holds all the way to the 
spectral edges, but only up to the logarithmic scale ry > (logiV)"'/iV. A better A^-dcpendence of 
the bounds in the following theorem (but a worse ^-dependence) can be achieved by following the 
dependence on k of the constants in Theorem 3.1 of [15]. 

All statements assume the conditions (2.4)-(2.6). We introduce the notation that [x]+ denotes 
the positive part of a real number x. 
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Theorem E.l (Gap distribution) Let H he an N x N hermitian Wigner matrix and let \E\ < 2. 
Denote by the largest eigenvalue below E and assume that a < N — 1. Then there are positive 
constants C,D,c,d such that 

p(a„+i -E>^,a<N-l)<C e-'=[2-l^^ll^'^^ (E.l) 

for any N>1 and any D{\ogNY/{2 - \E\) <K< nNd. 

Proof. The proof of this theorem can be obtained following the proof of Theorem 3.3 in [15], 
making use of Theorem 4.1 instead of Theorem 3.1 of [15] (in order to follow the |2— dependence 
of the probability). More precisely, we observe that the event \a+i—E > K/N implies that there is 
a gap of size K/N about the energy E' = E + K/{2N). Choosing M = for a sufficiently 

large constant D > 0, and 77 = K/{NAP) > (logTV)^, we find, similarly to (7.3)- (7.4) in [15], that, 
apart from a set il'^ of measure < Ce^'''^ , 

N 16 16-v/k 
Imm(£:' + nf) < — < — —, 

which implies, for sufficiently large D, that 

\miE' + irj) ~ m,,{E' + irj)\ > ^ (E.2) 

5 

where Co ~ ttq{E') > c^/k. The theorem then follows because, by Theorem 4.1, the event (E.2) 
has probability 

P (\miE' + irj) ~~ mscXE' + ir;)| > ck^/^) < Ce"""'''^^ < Ce"^"'^'^ . 

□ 

Theorem E.2 (Wegner Estimate) Let i? e M and set k := [2 — |i?|]+. There exists a constant 
C > such that for the number of eigenvalues Mi in the interval I = [E — e/ (2N); E + e/{2N)], 
we have 

Ce(logiV)4 



EAfi < — ^ \ (E.3) 



for every E Cz M. and £ < 1 . Moreover 



C(logiV)4 



y E\m(x + iy)r < ——^ — (E.4) 



for all X eR, y > 0. 



Theorem E.3 (Level Repulsion) Let E E M. and set k := [2 — |i?|] + . There exists a universal 
constant C such that for the number of eigenvalues Mi in the interval I = [E — e/ (27V); E-\-e/{2N)\ 
we have 

P(M>2)<^-'"°«"'.: (E,5) 

for all _E G M, all < e < 1, and all N large enough. 

Proof. The proof of Theorem E.2 and Theorem E.3 follows exactly the proof of Theorem 3.4 
and, respectively, Theorem 3.5 in [15], after replacing Theorem 3.3 of [15] by Theorem E.l above 
(in order to follow the dependence on the distance from the edges). q 

Note that the results of the last three theorems are only useful in the regime of very small 
e^N\I\ < (logiV)-4. 
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F Properties of the equilibrium measure 

Here we check the conditions (a) and (b) in Theorem 11.1. The main ingredient is the foUowing: 

Lemma F.l Let L Cz G and y G 3^l- After rescaling, then for any fixed a > with J' = 
[— 1 + ct/2, 1 — (j/2], the first and second derivatives of the potential are uniformly bounded on J' , 
i.e. 

1,2, (F.l) 



sup (x)| < 
xeJ' 



where the constant is independent of y. Furthermore, the endpoints a, h of the support of the 
equilibrium measure v ~ Vy satisfy 



l|,|fo-l| <Cn-"'/^\o: 



•gn. 



(F.2) 



Condition (b) of Theorem 11.1 is given now by (F.l). To see condition (a) of Theorem 11.1, let 
[inj bn] denote the support of the equihbrium measure zy„, then a„ — ^ — 1 and — ^ 1 as n — > cx), 
thus Qn is positive on J = [—1 + cr, 1 — cr] for any fixed a > and any sufficiently large n. 

For the uniform boundedness of gn{x) on J, we use the explicit formula (see, e.g. Theorem 2.5. 
of [24]): 

5n(^) = T!^^{x-a.,,){bn-x) P.V. / " ™ ^=^_= ds, (F.3) 



v/(s -««)(&„ -s) 

where P.V. denoted principal value. For sufficiently large n and for any x €z J the singularity of 
{s — x)~^ is uniformly separated away from a„ and b„, i.e. from the singularity of the square roots. 
Moreover, V^{x) is a smooth function inside (—1, 1) with 

supsnpK{x)\ + \V:{x)\<C. 

according to (F.l). Thus the uniform boundedness of .g„ on J follows immediately from (F.3) with 
standard estimates on the principal value. 

Proof of Lemma F.L Recall the definition El = ^jJ^{LN~^) from (6.11). For y S we know 
from the first bound in (6.13) that dist{Ly, El) < CrT'^l^, and from (4.22) that 



Vk = y-i 



with go := Qsc{El)^ assuming 7 < 1/20 and Cn < \k\ < < N-^^^. After rescaling, this 
corresponds to 

^ ' Qo-=Qsc{El), (F.4) 



Vk 



ngo 



and we will drop the tilde for the rest of this proof. This bound on the location of yk 's will be used 
to estimate the derivatives of Uy. For £ ^ 1,2 and x E J' we have 



if/f(x)i<- y: ' 



\k\<Cn ' 



< Ca- 



C 
n 



E 



Cn<k<n^ 



E 

Cn<\k\<n'^ 
1 



1 



[x - Vkf 
1 



{x - VkY [x - y-kY 



(F.5) 
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For ^ = 2 we can use in the second sum that \x — y±k\ > C'fc[ngo] ^ for fc > Cn by (F.4), thus 
\Uy{x)\ < C(cr). For £ = 1 we esthnate 



x-yk X- y-k 



2x - yk - y-k 



{x - yk){x - y-k) 



- P "^'o 



where we used (F.4) and k > Cn. After summation we conclude that |C/y(a;)| < C(cr) and thus 
(F.l) is proven. 

To estimate the location of the endpoints, we substitute V{x) = Uy{x) into the equations 
(11.2). We have 



E 



ds 



^is-a){b-s)s-yk 
^ ^ L ^{s^a){b-^s-yk " 







|fe|<r 



(F.6) 



(F.7) 



We will need the following explicit integration formulae for a < 6 (see, e.g. Formula 2.266 in [17]) 



1 



ds 



a ^is~a){b-s)s-y ^{a-y){b-y) 

1 ds TT 



a ^{s-a){b- s)s-y ^{a~y){b-y) 



if y < a <b, 
if a < b < y. 



ds 



la y^{s-a){b-s) 
With these formulae, (F.6) and (F.7) can be written as 

1 ^ V ^ 

i<k<nB ^{a-yk){b-yk) 



- E 



n ^{a-yk){h-yk) ^ 



-nB<fc<- 



1 

n 



E 



Vk 



-nB<k<- 



1 ^ \J [a ~ yk){b ~ yk) 



1 



- y \^ 



yk 



1 



-1. 



(F.9) 



(F.IO) 



(F.ll) 



Using the bound (F.l) on the location of yfc's, we replace the limit —n^ < k with —Y < yk and 
the limit k < with yk < F in the summations in (F.IO) and (F.ll), where Y := n^^^QQ^- We 
have, for example, for the first sum (F.IO), 



- E 

n ^ — ' 



-7iB<k<- 



_i \J {a - yk){b - yk) 



-Y<yk< 



.1 V(a - yk){h-yk) 



+ 0(n-i/5r-i), (F.12) 



and the estimate for the other three sums in (F.IO), (F.ll) is identical. 

With similar argument, we can remove the j/fc's that are too close to [—1,1]. Let X = n^~^, 
then 

C I ^ 1 CjiT-i 



< - 



E 



< 



-1-X<yk< 



_i ^/{a-yk){b-yk) ^{a + l){b + iy 
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where, for the lower bound, we used that = —1, while for the upper bound we used that the 
number of yk's in [—1 — X, — 1] is at most Cn"' (see the third set in the definition of (4.18)) . 
Similarly we have 



1 



< 



-^<i y — 



and for the sums in (F.ll) 



< 



1 



V(a + l)(fo+l) « 
C 



E 



-Cn'<- 



< 



-1-X<yk< 
1 



Vk 



C 



n\J\ - b n 



- E 



l<yk<l+X 



Vk 



Define 



m:=^ E 

'Y<yk<- 



V{a-yk){b-yk) 

- T. 

n ^—i 



- 1 



< 



nVo+T 



v/(i-^)(i-«) 



1 



E 



Vk 



\J{o. - yk){b-yk) 



- E 



V(a - yk){b - yk) 
Vk 



l+X<yk<Y 



V(a - yk){b-yk) 



-1 



(F.13) 



1 (F.14) 



to be the truncated summations. Combining the above estimates with estimates of type (F.12) 
and using B > 2 so that rCY-^ < n^~^, we get from (F.IO), (F.ll) that 



C 



iVT^ v/(a + l)(fo+l) 



C 



^{l - b){l - a) nVa + 1 



and 



C 



C 



l^/a + l n-y/l — b 



v/(a + l)(6+l) ^{l-b){l-a) 



(F.15) 



(F.16) 



Using that for y £ the number of eigenvalues in any interval of size at least rC N ^ (before 
rescaling) is approximated by the semicircle law with a precision nT'^ 1"^ (see (4.16)), we get 



-i-x 



dy 



Y V(« - y)ib - y) J^+x \/{a-y){b-y) 



dy 



+ 0{n-^/^logn) (F.17) 



and 



W2 = Qo 



-i-x 



Via - y)[b-y) 



i+x 



Here we also used that 



sup 



dy 



a.be[-i.i]Ji+x \J{a-y){b- y) 



Via - y){b- y) 

< C\ogn 



:-l 



d?/ + l + C'(n-''/Mog7i). 

(F.18) 
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and 



sup / 

a,he[-l,l] Jl+X 



- 1 



< Clogn. 



Let u = ^(a + &) and v = ^(6 — a) and we can assume, by symmetry, that u > 0. Then we can 
change variables in the integrals in (F.17) 



Wi = go 



dy 



dy 



Y-u 

-Y-u \Jy'^ — Ji+x-u \Jy'^ — 



0[n-^l'^ logn) 



dy 



dy 



'Y-u \Jy'^ — 

The first term is of order Y^^ and thus negligible. Thus, from the lower bound in (F.15), we have 
li+x-u ^/y^~v^ ~ V(a + 1)(6 + 1) n^l - b 



Qo 



+ Cn-^l'^ log 



Estimating y"^ ~ < {1 + X + a){l + X + b) on the integration domain, we get 

Cugo . Cn'^^^ 



< 



y^{l + X + a){l+X + b) y^{a + l){b+l) 

X 



+ Cn-''/^\ogn, 



Clearly a + 1 > 2m, thus 



Cugo < Cn''-^ { 1 + \ + Cn-'^l'^ log 

V a + 1 / 



Cugo <Cn^"^(l + ^) + Cn^'^/^ logn. 



from which it follows that 



u < Cn-'^/^logn 

if 7 < 1/2. The case u < is treated similarly, thus we have shown that 

|a + 6| < Cn-'^/^logn. 



(F.19) 



Now we consider the W2 and assume again that u > 0. With the same change of variables as 
above, we have 



W2 =go 



=ga 



-l-X-u 



Y-u 



Y-u 



y + u 
y 



dy - go 



dy - 



Y-u 



Y-u 



1+X-tt 



y + u 

y/y^~V^ 

y 



- 1 



where we used (F.19) and 

r-\-X-u 



1 



-Y-u \f\F-^ 



-.dy < Clogn and 



Y-i 



+X-U \/y'^ -v^ 



dy + 1 + 0(n"''/3 logn) 

dy + \^0(n-^l'^(^ognf), 
(F.20) 



dy < Clogn. 
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The integrals on the r.h.s of (F.20) can be exphcitly computed: 



l+X-u 



y 



\-x- 



Ay = 



1 



^(1 + X + u)^ -v^ + 1 + X + u 



thus we have 



W2>1~ 2gov^ - Cn-~^/^{\ognf ~ CY-^ > 1 Cn-^^^{lognf 

TT 

by using that < 1 and £»o < tt^^ (see (2.7)). Combining this estimate with the upper bound in 
(F.16), we have 



1 - - - Cn-^/^(logn)^ < __£ r'' ' + , . , . 

^ " ^{a + l){b+l) ^{l-b){l-a) ~ a + 1 1-b 



< 



by using a < b. Therefore either a + 1 or 1 — 6 is smaller than Cn^ ^, but then by using (F.19) we 
obtain that both of them are smaller then Cn"''/'^ log n. This completes the proof of Lemma F.l. 



□ 
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